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To Donna 




Preface 



This book presents the basic theory of fields, starting more or less 
from the beginning. It is suitable for a graduate course in field theory, 
or independent study. The reader is expected to have absorbed a serious 
undergraduate course in abstract algebra, not so much for the material 
it contains but for the oft-mentioned mathematical maturity it provides. 

The book begins with a preliminary chapter (Chapter 0), which is 
designed to be quickly scanned or skipped and used as a reference if 
needed. The remainder of the book is divided into three parts. 

Part 1, entitled Basic Theory , begins with a chapter on polynomials. 
Chapter 2 is devoted to various types of field extensions. In Chapter 3, 
we treat algebraic independence, starting with the general notion of a 
dependence relation and concluding with Luroth’s Theorem on 
intermediate fields of a simple transcendental extension. Chapter 4 is 
devoted to the notion of separability of algebraic extensions. 

Part 2 of the book is entitled Galois Theory. Chapter 5 begins with 
the notion of a Galois correspondence between two partially ordered 
sets, and then specializes to the Galois correspondence of a field 
extension, concluding with a brief discussion of the Krull topology. In 
Chapter 6, we discuss the Galois theory of equations. In Chapter 7, we 
take a closer look at a finite field extension E of F as a vector space 
over F. The next two chapters are devoted to a fairly thorough 
discussion of finite fields. Mobius inversion is used in a few brief spots 
in these chapters, so an appendix has been included on this subject. 

Part 3 of the book is entitled The Theory of Binomials. Chapter 10 
covers the roots of unity (that is, the roots of the binomial x n — 1) and 
includes Wedderburn’s theorem (a finite division ring is a field). This 
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viii 



also seems like the appropriate time to discuss the question of whether a 
given group is the Galois group of a field extension. In Chapter 11, we 
characterize the splitting fields of binomials x n — u, when the base field 
contains the n-th roots of unity. Chapter 12 is devoted to the question 
of solvability of a polynomial equation by radicals. (This chapter might 
make a convenient ending place in a graduate course.) In Chapter 13, 
we determine conditions that characterize the irreducibility of a 
binomial and describe the Galois group of a binomial. Chapter 14 
briefly describes the theory of families of binomials — the so-called 
Kummer theory . 

Sections marked with an asterisk are optional, in that they may be 
skipped without loss of continuity. The unmarked sections might be 
considered as forming a basic core course in field theory. 
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Chapter 0 

Preliminaries 



The purpose of this chapter is to review some basic facts that will be 
needed in the book . The discussion is not intended to be complete, nor 
are all proofs supplied . We suggest that the reader quickly skim this 
chapter (or skip it altogether) and use it as a reference if needed. 



0.1 Lattices 

Definition A partially ordered set (or poset) is a nonempty set P, 
together with a binary relation < on P satisfying the following 
properties. For all a, /?, 7 E P, 

1 ) (reflexivity) a < a 

2) (antisymmetry) <*</?, /3 < a a = /3 

3) (transitivity) <*</?, /?< 7 =>a <7 

If, in addition, 

a, j8GP^a<j3or/?<a 

then P is said to be totally ordered. □ 

Any subset of a poset P is also a poset under the restriction of the 
relation defined on P. A totally ordered subset of a poset is called a 
chain. If S C P and s < ot for all s E S then a is called an upper bound 
for S. A least upper bound for S, denoted by lub(S) or V S, is an upper 
bound that is less than or equal to any other upper bound. Similar 
statements hold for lower bounds and greatest lower bounds, the latter 
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denoted by glb(S), or A S. A maximal element in a poset P is an 
element a G P such that a < /3 implies a = /?. A minim al element in a 
poset P is an element 7 G P such that ft < 7 implies /? = 7. Zorn’s 
Lemma says that if every chain in a poset P has an upper bound in P 
then P has a maximal element. 

Definition A lattice is a poset L in which every pair of elements a, 
P G L has a least upper bound, or join, denoted by a V /? and a greatest 
lower bound, or meet, denoted by a A /?. If every nonempty subset of L 
has a join and a meet then L is called a complete lattice. □ 

Note that any nonempty complete lattice has a greatest element, 
denoted by 1 and a smallest element, denoted by 0. 

Definition A sublattice of a lattice L is a subset S of L that is closed 
under meets and joins. 0 

It is important to note that a subset S of a lattice L can be a lattice 
under the same order relation and yet not be a sublattice of L. As an 
example, consider the collection If of all subgroups of a group G, 
ordered by inclusion. Then If is a subset of the power set ^(G), which is 
a lattice under union and intersection. But If is not a sublattice of ®P(G) 
since the union of two subgroups need not be a subgroup. On the other 
hand, If is a lattice in its own right under set inclusion, where the meet 
H A K of two subgroups is their intersection and the join H V K is the 
smallest subgroup of G containing H and K. 

In a complete lattice L, joins can be defined in terms of meets: V T 
is the meet of all upper bounds of T. The fact that 1 G L insures that T 
has at least one upper bound, so that the meet is not an empty one. 
The following theorem exploits this idea to give conditions under which 
a subset of a complete lattice is itself a complete lattice. 

Theorem 0.1.1 Let L be a complete lattice. If S C L has the properties 
(i) 1 G S and (ii) T C S, T ^ 0 => ATgS, then S is a complete lattice. 

Proof. Let TCS. Then A T G S by assumption. Let U be the set of all 
upper bounds of T that lie in S. Since 1 G S, we have U ^ 0 . Hence, 
A U G S and is V T. Thus, S is a complete lattice. (Note that S need 
not be a sublattice of L since A U need not equal the meet of all upper 
bounds of T in L.) I 
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0.2 Groups 

Definition A binary operation on a set A is a map from A x A to A. 0 

Definition A group is a nonempty set G, together with a binary 
operation on G, denoted by juxtaposition, with the following properties: 

1) (Associativity) (a/3)y = a(/3y) for all a, /?, y £ G; 

2) (Identity) There exists an element e G G for which ea = ae = a for 
all a G G; 

3) (Inverses) For each qGG, there is an element a -1 G G for which 
aa~ l = a -1 a = e. 

A group G is abelian, or commutative, if a/3 = /?a, for all a, /3 G G. D 

The identity element is often denoted by 1. When G is abelian, the 
group operation is often denoted by + and the identity by 0. 

Definition A subgroup S of a group G is a subset of G that is a group in 
its own right, using the restriction of the operation defined on G. We 
denote the fact that S is a subgroup of G by writing S < G. D 

Let G be a group. Since G is a subgroup of itself and since the 
intersection of subgroups of G is a subgroup of G, Theorem 0.1.1 
implies that the set of subgroups of G forms a complete lattice, where 
H A J = H f! J and H V J is the smallest subgroup of G containing both 
H and J. We denote this lattice by If (G). 

A group G is finite if it contains only a finite number of elements. 
The cardinality of a finite group G is called its order and is denoted by 
| G | or o(G). If a G G, and if a k = e for some integer k, we say that k 
is an exponent of a. The smallest positive exponent for a G G is called 
the order of a and is denoted by o(c*). An integer m for which a m = 1 
for all a G G is called an exponent of G. (Note: Some authors use the 
term exponent of G to refer to the smallest positive exponent of G.) 

Theorem 0.2.1 Let G be a group and let a G G. Then k is an exponent 
of a if and only if k is a multiple of o(a). Similarly, the exponents of G 
are precisely the multiples of the smallest positive exponent of G. 0 

While the smallest positive exponent of an element a G G is the 
order of the cyclic subgroup (a) = {a 11 | n G Z}, this does not extend to 
groups in general, that is, the smallest positive exponent of G may be 
smaller than the order of G. (Example: Z 2 x Z 2 has exponent 2 but 
order 4.) We next characterize the smallest positive exponent for finite 
abelian groups. 
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Theorem 0.2.2 Let G be a finite abelian group. 

1) If m is the maximum order of all elements in G then a m = 1 for 
all a E G. Thus, the smallest positive exponent of G is equal to 
the maximum order of all elements of G. 

2) The smallest positive exponent of G is equal to o(G) if and only if 
G is cyclic. 

Proof. Let a have maximum order m among all the elements in G. 
Suppose that f3 m ^ 1 for some (3 E G and let o(/3 ) = k < m. It follows 
that k/m and so there exists a prime p for which p u | k but p u /m. Let 
v < u be the largest integer for which p v | m. Consider the elements 

a' = a p and /?' = /? k ^ p 

Since o(a') = m/p v and o(/3') = p u and since (m/p v ,p u ) = 1, it follows 
that 

o(a'/?') = o{a , )o{P') = mp u-v > m 

in contradiction to the maximality of m. Thus, all elements (3 E G 
satisfy f3 m = 1. Clearly, m = o(a) is the smallest such positive integer 
and part 1) is proved. Part 2) follows easily from part 1), since a finite 
group G is cyclic if and only if it has an element of order o( G). I 

Let H < G. We may define an equivalence relation on G by saying 
that a ~ j3 if /? _1 a E H (or equivalently a"" 1 /? E H). The equivalence 
classes are the left cosets aH = {ah | h E H} of H in G. Thus, the 
distinct left cosets of H form a partition of G. Similarly, the distinct 
right cosets Ha form a partition of G. It is not hard to see that all 
cosets of H have the same cardinality and that there are the same 
number of left cosets of H in G as right cosets. (This is easy when G is 
finite. Otherwise, consider the map aH»— ►Ha” 1 .) 

Definition The index of H in G, denoted by (G:H), is the cardinality of 
the set G/H of all distinct left cosets of H in G. If G is finite then 
(G:H) = | G | / | H | . D 

Theorem 0.2.3 Let G be a finite group. 

1) (Lagrange) The order of any subgroup of G divides the order of G. 

2) The order of any element of G divides the order of G. 

3) (Converse of Lagrange’s Theorem for Finite Abelian Groups) If A 
is a finite abelian group and if k | o(A) then A has a subgroup of 
order k. □ 
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Normal Subgroups 

Definition A subgroup H of G is normal in G, written H < G, if 
ftHa” 1 = H for all a E G. D 

Definition A group G is simple if it has no normal subgroups other than 
{1} and G. 0 

Theorem 0.2.4 The following are equivalent for a subgroup H of G. 

1) H<G. 

2) aH = Ha for all a E G. 

3) For all a E G, there exists a /? E G such that aH = H /?. 

4) aHa' 1 C H for all a E G. 

5) a/? E H => /?a E H for all a, /? E G. □ 

Theorem 0.2.5 Any subgroup H of a group G of index 2 is normal. □ 

Theorem 0.2.6 If G is a group and {HJ is a collection of normal 
subgroups of G then fl Hj and V Hj are normal subgroups of G. Hence, 
the collection of normal subgroups of G is a complete sublattice of the 
complete lattice if(G) of all subgroups of G. □ 

Theorem 0.2.7 If H < G then the set G/H of all right cosets of H in G 
forms a group under the operation (aH)(/?H) = a/?H if and only if 
H <3 G. The group G/H is called the quotient group (or factor group) of 
H in G. The order of G/H is (G:H). □ 

Euler's Formula 

If a and /? are integers, not both zero, then an integer 6 is called a 
greatest common divisor (gcd) of a and /? if (i) 6 | a and 6 | /? and (ii) if 
7 | a and 7 | /? then 7 | 6. Note that if 6 is a gcd of a and /?, then so is 
-6. It is customary to denote a gcd of a and /? by (a, ft) or gcd (a,/?). 

If (a,/?) = 1, then a and /? are relatively prime. The Euler phi 
function <f) is defined by letting <j>( n) be the number of positive integers 
less than or equal to n that are relatively prime to n. The Euler phi 
function is multiplicative , that is, 

<£(mn) = <^(m)<£(n), when (m,n) = 1 

It also satisfies 

<£(p n ) = P n-1 (P - 1). P prime, n > 0 
These two properties completely determine <j>. 
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Two integers a and (3 are congruent modulo n, written a = f3 mod n, 
if a — (3 is divisible by n. Let Z n denote the ring of integers {0,...,n-l} 
under addition and multiplication modulo n. 

Theorem 0.2.8 (Euler’s Theorem) If c*, n G Z and (a,n) = 1, then 

a^( n ) = 1 mod n 

Proof. We first show that the set G = {(3 G Z n | (/?,n) = 1} is a group of 
order <£(n) under multiplication modulo n. Clearly, /?,7 G G imply /3j G 
G. Also, if /? G G, then there exists a, b G Z such that a/? + bn = 1 and 
so a/? = 1 mod n. Thus, a modn is the inverse of (3 G G. Since G is a 
group of order <^(n), we deduce that or ^ = 1 mod n, for all a G G. If 
a £ G, then there exists an a' G G for which a' = a mod n. Since 
(a,n) = 1 if and only if (a , ,n) = 1, we have 

= 1 mod n I 

Corollary 0.2.9 (Fermat’s Theorem) If p is a prime not dividing the 
integer a, then 

a p = a mod p 0 



Cyclic Groups 

If G is a group and a G G, then the set of all powers of a 

(a) = {a 11 1 n G Z} 

is a subgroup of G, called the cyclic subgroup generated by or. A group 
G is cyclic if it has the form G = (a), for some a G G. In this case, we 
say that a generates G. 

Theorem 0.2.10 Every subgroup of a cyclic group is cyclic. A finite 
abelian group G is cyclic if and only if its smallest positive exponent is 
equal to o(G). □ 

The following theorem contains some key results about finite cyclic 
groups. 

Theorem 0.2.11 Let G 

1) For 1 < k < n, 



= (a) be a cyclic group of order n. 



o(a k ) = 

’ (n,k) 
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In particular, a k generates G = (a) if and only if (n,k) = 1. 

2) If d | n, then 

o(a k ) = d k = r g, where (r,d) == 1 

Thus the elements of G of order d | n are the elements of the form 
a rn / d , where 0 < r < d and r is relatively prime to d. 

3) For each d | n, the group G has exactly one subgroup H d of order 
d and <j>(d) elements of order d, all of which lie in H d . 

Proof. To prove part 1), we first observe that if d = (k,n) then d = 
ak + bn for some integers a and b. Hence, 

<x d = (a k ) a G <a k ) 

whence (c* d ) C (c* k ). But the reverse inclusion holds since d | k and so 
(a k ) = (a d ). Since d | n, it is clear that 

«•» = ^>=3= Wo 

To prove part 2), we let d | n and solve the equation 

(ik) = d 

Rearranging gives 

n = d(n,k) = (dn,dk) 

Setting r = k/(n,k), we get dk = n[k/(n,k)] = nr and so 

n = (dn,rn) = n(d,r) 

which holds if and only if (d,r) = 1. 

For part 3), it follows from part 2) that all of the <^(d) elements of G 
of order d lie in the subgroup H d = (a n / d ). Moreover, if H is a subgroup 
of G of order d then, being cyclic, it must contain an element (3 of order 
d. But p G H d and so H = (ft) = H d . I 

Counting the elements in a cyclic group of order n gives the following 
corollary. 

Corollary 0.2.12 For any positive integer n, 

d | n 



D 
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Homomorphisms 

Definition Let G and H be groups. A map xp:G—>R is called a group 
homomorphism if xp(al3) = (\pa)(xp(3). A surjective homomorphism is an 
epimorphism, an injective homomorphism is a monomorphism and a 
bijective homomorphism is an isomorphism. If xp:G—>K is an 
isomorphism, we say that G and H are isomorphic and write G ~H. D 

If ^ is a homomorphism then xpe = e and xpa" 1 = (xpa)* 1 . The kernel 
of a homomorphism xp:G— >H, 

ker xp = {a E G | xpa = e} 

is a normal subgroup of G. Conversely, any normal subgroup H of G is 
the kernel of a homomorphism. For we may define the natural 
projection 7r:G— >G/H by ira = aH. This is easily seen to be an 
epimorphism with kernel H. 

Let f:S— »T be a function from a set S to a set T. Let ^(S) and ^P(T) 
be the power sets of S and T, respectively. We define the induced map 
f:?P(S)— ^(T) by f(U) = {f(u) | u £ U} and the induced inverse map by 
f~ 1 (V) = {s E S | f(s) E V}. (It is customary to denote the induced maps 
by the same notation as the original map.) Note that f is surjective if 
and only if its induced map is surjective, and this holds if and only if 
the induced inverse map is injective. A similar statement holds with the 
words surjective and injective reversed. 

Theorem 0.2.13 Let ^:G-+G' be a group homomorphism. 

1) a) If H < G then ^(H) < G # . 

b) If ip is surjective and H < G then H ) < G'. 

2) a) If H' < G' then ^(H') < G. 
b) If H' <i G' then V ,_1 ( H ') < G. □ 

Theorem 0.2.14 (The Isomorphism Theorems) Let G be a group. 

1) (First Isomorphism Theorem) Let t/>:G— >G' be a group 
homomorphism with kernel K. Then K < G and the map 
xp:G/K-*imip defined by \p(aK) = ipa is an isomorphism. Hence 
G/K ~ im ip. In particular, xp is injective if and only if kerxp = 

w- 

2) (Second Isomorphism Theorem) If H < G and N < G then 
N n H < H and 

_H NH 

N n H — N 
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3) (Third Isomorphism Theorem) If H < I « J < G then I/H < J /II 

and J/H j 

I/H ~ I 

Hence (J:I) = (J/H:I/H). Q 

Theorem 0.2.15 (The Correspondence Theorem) Let H < G and let tt be 

the natural projection 7r:G— >G/H. Thus, for any I < G, 

tt(I) = I/H = {iH | i € 1} 

1) The induced maps 7r and 7r _1 define a one-to-one correspondence 
between the lattice of subgroups of G containing H and the lattice 
of subgroups of G/H. 

2) 7 r preserves index, that is, for any H<I<J<G, we have 

(J:I) = WJ):*(I)) 

3) 7r preserves normality, that is, if H < I < J < G then I < J if and 
only if I/H < J/H, in which case J /I ~ 7r(J)/7r(I). 0 



Action of a Group on a Set 

Definition Let X be a set and let G be a group. We say that G acts on 
X if there is a function G x X— >X, sending (a,x) to ax G X, satisfying 

1) lx = x for all x G X 

2) (a/?)x = a(/?x) for all x G X, a, /? G G. 

We say that G acts transitively on X if for any x, y G X there exists an 
a G G such that ax = y. 0 

It follows from the definition that each a G G acts as a permutation 
7r a :xh->ax of X and that the map ai->7r a is a group homomorphism from 
G to a subgroup of the group of permutations of X. 

Definition Let G act on X. The orbit of x G X is the set 

orb(x) = Gx = {ax | a G G} 

The stabilizer of x is the subgroup 



G x = {a G G | ax = x} 



0 
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Note that G acts transitively on X if and only if orb(x) = X for all 
x G X. We may define an equivalence relation on X by setting x ~ y if 
and only if there exists an a G G for which ax = y. The equivalence 
classes are precisely the orbits in X, which therefore partition the set X. 
Since ax = /3x if and only if (3~ l a G G x , which in turn holds if and only 
if aG x = /?G X , we deduce the existence of a bijection from G/G x onto 
orb(x). 

Theorem 0.2.16 Let G act on X. 

1) For any x G X, | orb(x) | = (G:G X ) and if X is finite then 
I orb(x) | = | G | / | G x | . 

2) If G acts transitively on X then | X | = (G:G X ) for any x G X and 
if X is finite then | X | = | G | / | G x | . 

3) (The class equation) 

I X | = £(G:G x ) 

where the sum is taken over one representative from each distinct 
orbit in X. 0 

Example 0.2.1 One of the most important instances of a group acting 
on a set is the case where X = G acts on itself by conjugation. To avoid 
obvious confusion, we denote the action of a G G on ft G G by a/?. Then 
a/3 = after 1 . The orbit of ft G G is the conjugacy class of ft 

orb(ft) = { afta -1 | a G G} 

The stabilizer of ft G G is the centralizer of ft 

C(ft) = {aeG\aft = fta} 

The previous theorem says that the conjugacy class of ft has cardinality 
(G:C(/?)). The class equation in this case is 

o(G) = £(G:C(/?)) 

where the sum is over one representative of each conjugacy class. 

The center of G is the set Z(G) = {ft G G | aft = fta for all a G G}. 
Thus Z(G) consists of those elements of G whose centralizer is equal to 
the entire group G, or equivalently, whose conjugacy class contains only 
the element itself. In other words, ft G Z(C) if and only if (G:C(/?)) = 1. 
We may now write the class equation in the form 



o(G) = o(Z(G))+£(G:C(/?)) 
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where the sum is taken over one representative from each conjugacy 
class of size greater than 1. 0 



Sylow Subgroups 

Definition If p is a prime, then a group G is called a p-group if every 
element of G has order a power of p. 0 

For finite groups, if a E G then o(a) | o(G). The converse does not 
hold in general, but we do have the following. 

Theorem 0.2.17 Let G be a finite group. 

1) (Cauchy) If o(G) is divisible by a prime p then G contains an 
element of order p. 

2) If p is a prime and o(G) is divisible by p n then G contains a 
subgroup of order p n . □ 

Corollary 0.2.18 A finite group G is a p-group if and only if | G | = p n 
for some n. 0 

Theorem 0.2.19 (Sylow) If G has order p n m where p/m then G has a 
subgroup of order p n , called a Sylow p-subgroup of G. All Sylow p- 
subgroups are conjugate (and hence isomorphic). The number of Sylow 
p-subgroups of G divides o(G) and is congruent to 1 mod p. Any p- 
subgroup of G is contained is a Sylow p-subgroup of G. D 



The Symmetric Group 

Definition The symmetric group S n is the group of all permutations of 
the set A = {1,2,. . .,n}, under composition of maps. A transposition is 
a permutation that interchanges two distinct elements of A and leaves 
all other elements fixed. The alternating group A n is the subgroup of S n 
consisting of all even permutations, that is, all permutations that can 
be written as a product of an even number of transpositions. 0 

Theorem 0.2.20 

1) The order of S n is n!. 

2) The order of A n is n!/2. Thus, [S n :AJ = 2 and A n < S n . 

3) A n is the only subgroup of S n of index 2. 

4) A n is simple (no nontrivial normal subgroups) for n > 5. 0 
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A subgroup H of S n is transitive if for any k, j £ {l,2,...,n} there is 
a a £ H for which crk = j. 

Theorem 0.2.21 If H is a transitive subgroup of S n then o(H) is a 
multiple of n. 

Proof. The group H acts on the set X = {1,2,. . .,n} and Theorem 0.2.16 
gives | X | = | H | / | G x | , that is, | H | = n | G x | . I 



0.3 Rings 

Definition A ring is a nonempty set R, together with two binary 
operations on R, called addition (denoted by +), and multiplication 
(denoted by juxtaposition), satisfying the following properties. 

1) R is an abelian group under the operation + . 

2) (Associativity) (a(3)y = a((3y) for all a, /?, 7 £ R. 

3) (Distributivity) For all a, (3, 7 £ R, 

(a + /?) 7 = ay + a/3 and 7(0 + /?) = ya -f y/3 0 

Definition Let R be a ring. 

1) R is called a ring with identity if there exists an element 1 £ R for 
which al = la = a, for all a £ R. In a ring R with identity, an 
element a is called a unit if it has a multiplicative inverse in R, 
that is, if there exists a j3 £ R such that a/3 = (3 a = 1. 

2) R is called a commutative ring if multiplication is commutative, 
that is, if afi = /3a for all a, /? £ R. 

3) A zero divisor in a commutative ring R is a nonzero element a £ 
R such that a/3 = 0 for some (3 ^ 0. A commutative ring R with 
identity is called an integral domain if R contains no zero divisors. 

4) A ring R with identity 1 ^ 0 is called a field if the nonzero 
elements of R form an abelian group under multiplication. □ 

It is not hard to see that the set of all units in a ring with identity 
forms a group under multiplication. We shall have occasion to use the 
following example. 

Example 0.3.1 Let Z n = {0,...,n-l} be the ring of integers modulo n. 
Then k is a unit in Z n if and only if (k,n) = 1. This follows from the 
fact that (k,n) = 1 if and only if there exists integers a and b such that 
ak + bn = 1, that is, if and only if ak = 1 mod n. The set of units of Z n , 
denoted by Z*, is a group under multiplication. D 
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Definition A subring of a ring R is a nonempty subset S of R that is a 
ring in its own right, using the same operations as defined on R. 0 

Definition A subfield of a field E is a nonempty subset F of E that is a 
field in its own right, using the same operations as defined on E. In this 
case, we say that E is an extension of F and write F <EorE>F.O 

Definition Let R and S be rings. A function rp: R— »S is a homomorphism 
if, for all a, (3 E R, 

ip(a + (3) = xpa + xp(3 and = (rpa)(xpf3) 

An injective homomorphism is a monomorphism or an embedding, a 
surjective homomorphism is an epimorphism and a bijective 
homomorphism is an isomorphism. A homomorphism from R into itself 
is an endomorphism and an isomorphism from R onto itself is an 
automorphism. □ 



Ideals 

Definition A nonempty subset 3 of a ring R is called an ideal if it 
satisfies 

1) a, /? E 3 implies a — /? E 3. 

2) a E R, t E 3 implies ai E 3 and ta E 3. D 

If S is a nonempty subset of a ring R, then the ideal generated by S 
is defined to be the smallest ideal 3 of R containing S. If R is a 
commutative ring with identity, and if a E R, then the ideal generated 
by {a} is the set 

(a) = Ra = {pa | p E R} 

Any ideal of the form (a) is called a principal ideal. 

Definition If ipiR—tS is a homomorphism, then 
Kerxp = {a E R | 4>a = 0} 

is an ideal of R. 0 



If R is a ring and 3 is an ideal in R then for each a E R, we can form 
the coset 

a + 3 = {a + * E 3} 
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It is easy to see that a + 3 = /?-f3if and only if a — (3 £ 3, and that any 
two cosets a + 3 and /? + 3 are either disjoint or identical. The collection 
of all (distinct) cosets is a ring itself, with addition and multiplication 
defined by 

(a + 3) + (b + 3) = (a + b) + 3 

and 

(a + 3)(b 4- 3) = ab + 3 

The ring of cosets of 3 is called a factor ring and is denoted by R/3. 

Definition An ideal 3 of a ring R is maximal if 3 ^ R and if whenever 
3 Q 3 Q R for any ideal then j = 3 or $ = R. An ideal 3 is prime if 
3 ^ R and if ot(3 £ 3 implies a £ 3 or /? £ 3. 0 

It is not hard to see that a maximal ideal in a commutative ring 
with identity is prime. This also follows from the next theorem. 

Theorem 0.3.1 Let R be a commutative ring with identity and let 3 be 
an ideal of R. 

1) R/3 is a field if and only if 3 is maximal. 

2) R/3 is an integral domain if and only if 3 is prime. Q 

The Characteristic of a Ring 

Let R be a ring and let r £ R. For any positive integer n, we define 



nr = r + rH hr 

' 

n terms 

The characteristic char(R) of a ring R is the smallest positive integer n 
for which nl = 0 (or equivalently, nr = 0 for all r € R), should such an 
integer exist. If it does not, we say that R has characteristic 0. If 
char(R) = 0 then R contains a copy of the integers Z, in the form 
Z • 1 = {nl | n £ Z}. If char(R) = r, then R contains a copy of Z r = 

Theorem 0.3.2 The characteristic of an integral domain is either 0 or a 
prime. In particular, a finite field has prime characteristic. 0 

If F is a field, the intersection of all of its subfields is the smallest 
subfield of F and is referred to as the prime subfield of F. 
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Theorem 0.3.3 If char(F) = 0, the prime subfield of F is isomorphic to 
the rational numbers Q. If char(F) = p is prime, the prime field of F is 
isomorphic to Z p . 0 

The following result is of considerable importance for the study of 
fields of nonzero characteristic. 

Theorem 0.3.4 Let R be a commutative ring with identity of prime 
characteristic p. If q = p n then 



(a + /?) q = a q + /? q , (a - /?) q = a q - /? q 



Proof. Since the binomial formula holds in any commutative ring with 
identity, we have 

(a + /?)P=£(|V/? p - k 

. k=0 VK/ 

where 

(P\ _ p(p~ 1)’ ' •(p-k+l) 

\k)~ k! 



But p j for 0 < k < p, and 
therefore reduces to 



so 



p 



= 0 in R. The binomial formula 



(a + /?)P = aP + /?P 



Repeated use of this formula gives (a + /?) q = a q + /? q . The second 
formula is proved similarly. I 



0.4 Integral Domains 

Theorem 0.4.1 Let R be an integral domain. Let a, /? E R. 

1) We say that a divides /? and write a | /? if /? = pa for some p 6 R. 

If p and a are nonunits and /? = pa then a properly divides /?. 

a) A unit divides every element of R. 

b) a | /? if and only if (/?) C (a). 

c) a | (3 properly if and only if (/?) C (a) C R. 

2) If a = uf3 for some unit u then a and f3 are associates and we 
write a ~ /?. 

a) a ~ f} if and only if a \ f) and (3 | a. 

b) a ~ /3 if and only if (a) = (/?). 

3) A nonzero element p £ R is irreducible if p is not a unit and if p 
has no proper divisors. Thus, a nonunit p is irreducible if and only 
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if p = a(3 implies either a or /? is a unit. 

4) A nonzero element 7r E R is prime if 7 r is not a unit and whenever 
7 r | a/3 then tt \ a or 7r | /?. 

a) Every prime element is irreducible. 

b) 7r E R is prime if and only if (n) is a nonzero prime ideal. 

5) Let a, P E R. An element d E R is called a greatest common 
divisor (gcd) of a and f3 , written (a,/?) or gcd(a,/?), if d | a and 
d | P and if whenever e | a, e | /? then e | d. If gcd(a,/?) is a unit, we 
say that a and (3 are relatively prime. 

a) The greatest common divisor of two elements, if it exists, is 
unique up to associate. □ 

Theorem 0.4.2 An integral domain R is a field if and only if it has no 
ideals other than the zero ideal or R itself. Any nonzero homomorphism 
<r:F— *E of fields is a monomorphism. □ 

Theorem 0.4.3 Every finite integral domain is a field. 0 

If R is an integral domain, we may form the set 

R' = | a, p E R, P * 0} 

where a//? = a/b if and only if ab = a/?. We define addition and 
multiplication on R' in the “obvious way” 

a , a _ ftb -f P& a a _ oa 
j0 + b“ fib ’ /?’b~/?b 

It is easy to see that these operations are well-defined and that R' is 
actually a field, called the field of quotients of the integral domain R. It 
is the smallest field containing R, in the sense that if F is a field and 
RCF then R C R ; C F. The following fact will prove useful. 

Theorem 0.4.4 Let R be an integral domain with field of quotients R\ 
Then any monomorphism a: R— >F from R into a field F has a unique 
extension to a monomorphism ^:R'— >F. 

Proof. Define a(a/(3) = cra/cr/?, which makes sense since (3 / 0 implies 
cr(3 ^ 0. One can easily show that a is well-defined. Since aot/i t(3 = 0 if 
and only if era = 0, which in turn holds if and only if ot/P — 0, we see 
that a is injective. Uniqueness is clear since a | R (a restricted to R) 
uniquely determines a on R'. I 
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0.5 Unique Factorization Domains 

Definition An integral domain R is a unique factorization domain (ufd) 
if 

1) Any nonunit r 6 R can be written as a product r = pj* • -p n where 
Pj is irreducible for all i. We refer to this as the factorization 
property for R. 

2) This factorization is essentially unique in the sense that if r = 
Pj. . -p n = qi* . .q m are two factorizations into irreducible elements 
then m = n and there is some permutation 7 r for which pj ~ q 
for all i. D 

If r E R is not irreducible, then r = st where s and t are nonunits. 
Evidently, we may continue to factor as long as at least one factor is 
not irreducible. An integral domain R has the factorization property 
precisely when this factoring process always stops after a finite number 
of steps. 

When is an integral domain a unique factorization domain? The 
following answer helps explain the importance of ufd’s. 

Theorem 0.5.1 Let R be an integral domain for which the factorization 
property holds. The following conditions are equivalent and therefore 
imply that R is a unique factorization domain. 

1) Factorization in R is essentially unique. 

2) Every irreducible element of R is prime. 

3) Any two elements of R (not both zero) have a greatest common 
divisor. □ 

Corollary 0.5.2 In a unique factorization domain, the concepts of prime 
and irreducible are equivalent. 0 



0.6 Principal Ideal Domains 

Definition An integral domain R is called a principal ideal domain (pid) 
if every ideal of R is principal. D 

Theorem 0.6.1 Every principal ideal domain is a unique factorization 
domain. 0 

We remark that the ring Z[x] is a ufd (as we prove in Chapter 1) but 
not a pid (the ideal (2,x) is not principal) and so the converse of the 
previous theorem is not true. 
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Theorem 0.6.2 Let R be a principal ideal domain and let 3 be an ideal 
of R. 

1) 3 is maximal if and only if 3 = (p) where p is irreducible. 

2) 3 is prime if and only if 3 = {0} or 3 is maximal. 

3) The following are equivalent: (i) R/(p) is a field (ii) R /(p) is an 
integral domain (iii) p is irreducible (iv) p is prime. 0 

0.7 Euclidean Domains 

Roughly speaking, a Euclidean domain is an integral domain in 
which we can perform “division with remainder.” 

Definition An integral domain R is a Euclidean domain if there is a 
function <r:(R-{0})— with the property that given any a, /? G R, 
/? ^ 0, there exist q, r G R satisfying 

a = q/3 + r 

where r = 0 or err < <r/?. □ 

Theorem 0.7.1 A Euclidean domain is a principal ideal domain (and 
hence also a unique factorization domain). 

Proof. Let 3 be an ideal in the Euclidean domain R and let a G 3 be 
minimal with respect to the value of a. Thus, <ra < <r/3 for all /? G 3. If 
s G 3 then 

s = ra + q 

where q = 0 or <rq < err. But q = s - ra G 3 and so the latter is not 
possible, leaving q = 0 and s G (e*). Hence, 3 = (a). I 

Theorem 0.7.2 If F is a field, then F[x] is a Euclidean domain with 
<r(p(x)) = deg p(x). Hence F[x] is also a principal ideal domain and a 
unique factorization domain. 

Proof. This follows from ordinary division of polynomials; to wit, if 
f(x), g(x) G F[x], g(x) ^ 0, then there exist q(x), r(x) G F[x] such that 

f(x) = q(x)g(x) + r(x) 



where deg r(x) < deg g(x). I 
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0.8 Tensor Products 



Tensor products are used only in the optional Section 5.6. 



Definition Let U, V and W be vector spaces over a field F. A function 
f:U xV— W is bilinear if it is linear in both variables separately, that is, 
if 



f(ru + su',v) = rf(u,v) -f sf(u',v) 

and 

f(u,rv + sv') = rf(u,v) -f sf(u,v') 



The set of all bilinear functions from U x V to W is denoted by 
*35(11, V;W). A bilinear function f:U xV-+F, with values in the base field 
F, is called a bilinear form on U x V. 0 



Example 0.8.1 

1) A real inner product (,):V x V-4? is a bilinear form on V x V. 

2) If A is an algebra, the product map fi : A x A— >A defined by 
/i(a,b) = ab is bilinear. 0 

We will denote the set of all linear transformations from U x V to W 
by JL(U x V,W). There are many definitions of the tensor product. We 
choose a universal definition. 



Theorem 0.8.1 Let U and V be vector spaces over the same field F. 
There exists a unique vector space U 0 V and bilinear map 
<:UxV->U(8)V with the following property. If f:UxV- >W is any 
bilinear function from U x V to a vector space W over F, then there is a 
unique linear transformation r:U 0 V— *W for which 

T O t = f 0 

This theorem says that to each bilinear function f:UxV->W, there 
corresponds a unique linear function r:U 0 V— »W, through which f can 
be factored (that is, f = r o t). The vector space U 0 V, whose existence 
is* guaranteed by the previous theorem, is called the tensor product of U 
and V over F. We denote the image of (u,v) under the map t by 
t( u,v) = u0v. 

If X = Im i = {u 0 v | u E U, v E V} is the image of the tensor map t 
then the uniqueness statement in the theorem implies that X spans 
U 0 V. Hence, every element of a E U 0 V is a finite sum of elements of 
the form u 0 v 

« = £ a i( u i® v i) 
finite 
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We establish a few basic properties of the tensor product. 



Theorem 0.8.2 If {u 1 ,...,u n } C U is linearly independent and 
{vi,...,v n } C V then 

Y Uj ® Vj = 0 =» V; = 0 for all i 



Proof. Consider the dual vectors E U* to the vectors u^, where 6jUj = 
S^y For linear functionals e i :V-+F, we define a bilinear form f:U x V— »F 



by 



f ( u - v ) = 5D 6 j( x ) e j(y) 

j=i 



Since there exists a unique linear functional r:U<8>V— >F for which 
r o t = f, we have 



0 = r (^ Ui ® Vi ) = H ro< ( u i> v i) 

i i 

= E f ( u i- v i) = E E^KW = E f i( v i) 

i i j i 

Since the q’s are arbitrary, we deduce that Vj = 0 for all i. I 



Corollary 0.8.3 If u ^ 0 and v / 0, then u <8> v ^ 0. D 



Theorem 0.8.4 Let *35 = {ej | i E 1} be a basis for U and C = {fj | j E J} be 
a basis for V. Then the set <3) = {ej 0 fj | i E I, j E J} is a basis for U 0 V. 

Proof. To see that the *3) is linearly independent, suppose that 

E r ij( e i® f j) = ° 

This can be written 1J 

E e i®(E r ij f j)=° 

1 j 

Theorem 0.8.2 implies that 

E r ij f j =0 

j 

for all i, and hence r- j = 0 for all i and j. To see that *3) spans U 0 V, let 
u 0 v E U 0 V. Since u = r^, and v = Sjfj, we have 

1 j 

u 0 v = ^ rjej 0 ^2 s jfj = E s j(E r i e i® f j) 

i j j i 
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= ]C s j(5Z r i( e i® f j))= £ r i s j( e i® f j) 

j i ij 

Since any vector in U®V is a finite sum of vectors u(g>v, we deduce 
that spans U <g> V. I 

Corollary 0.8.5 For finite dimensional vector spaces, 

dim { U <8> V) = dim(\J) • dim(V) D 



Exercises 

1. The relation of being associates in an integral domain is an 
equivalence relation. 

2. Prove that the characteristic of an integral domain is either 0 or a 
prime, and that a finite field has prime characteristic. 

3. If char(F) = 0, the prime subfield of F is isomorphic to the 
rational numbers Q. If char(F) = p is prime, the prime field of F 
is isomorphic to Z p . 

4. If F < E show that E and F must have the same characteristic. 

5. Let F be a field of characteristic p. The map <r:F— >F defined by 
a a = a p is a homomorphism. It is called the Frobenius map. 
Show that F « F p = {a p | a E F}. What if F is a finite field? 

6. Consider the polynomial ring F[x 1 ,x 2 ,...] where x- 2 = x^. Show 
that the factorization process need not stop in this ring. 

7. Let R = T[y/-5] = {a -f by^5 | a,b E Z}. Show that this integral 
domain is not a unique factorization domain by showing that 6 E 
R has essentially two different factorizations in R. Show also that 
the irreducible element 2 is not prime. 

8. Let R be a pid. Then an ideal 3 of R is maximal if and only if 3 = 
(p) where p is irreducible. Also, R/(p) is a field if and only if p is 
irreducible. 

9. Prove that (x) and (2,x) are both prime ideals in Z[x] and that (x) 
is properly contained in (2,x). 

10. Describe the divisor chain condition in terms of principal ideals. 




Part 1 

Basic Theory 




Chapter 1 

Polynomials 



In this chapter, we discuss properties of polynomials that will be 
needed in the sequel. Since we assume that the reader is familiar with 
the basic properties of polynomials, some of the present material may 
constitute a review. 



1.1 Polynomials Over a Ring 

We will be concerned in this book mainly with polynomials over a 
field F, but it is useful to make a few remarks about polynomials over a 
ring R as well. Let R[x] denote the ring of polynomials in the single 
variable x over R. If 



p(x) = a 0 -f ajX + h a n x n 

where aj E R and a n ^ 0 then n is called the degree of p(x), 
written deg p(x) and a n is called the leading coefficient of p(x). A 
polynomial is monic if its leading coefficient is 1. The degree of the zero 
polynomial is defined to be -oo. 

If R is a ring, the units of R[x] are the units of R, since no 
polynomial of positive degree can have an inverse in R[x]. 

Definition Let R be a ring. A polynomial p(x) E R[x] is irreducible over 
R if whenever p(x) = f(x)g(x) for f(x), g(x) E R[x], then one of f(x) or 
g(x) is a unit in R[x]. A polynomial that is not irreducible is said to be 
reducible. 0 
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Many important properties that a ring R may possess carry over to 
the ring of polynomials R[x]. For instance, if R is an integral domain, 
then so is R[x] and if R is a unique factorization domain, then so is 
R[x]. Note, however, that the ring Z of integers is a principal ideal 
domain, but Z[x] is not, since the ideal (2,x) is not principal. 
Nonetheless, if F is a field, F[x] is a principal ideal domain (Theorem 
0.7.2). 

1.2 Primitive Polynomials 

We now consider polynomials over a unique factorization domain. 
The reader may wish to take a quick look at Section 0.5. 

Definition Let f(x) £ R[x] where R is a unique factorization domain. 
Any greatest common divisor of the coefficients of f(x) is called a 
content of f(x). A polynomial with content 1 is said to be primitive. We 
will use the notation c(f) to denote a content of f(x). □ 

If a is a content of f(x), then /? is also a content of f(x) if and only if 
P ~ a, that is, (3 = ua, where u is a unit in R. Since 

c(ap(x)) ~ ac(p(x)) 

for all a £ R, it follows that a is a content of f(x) if and only if f(x) = 
ap(x), where p(x) is primitive. 

We can also define the content of a polynomial over R', the field of 
quotients of R. To this end, if p is a prime in R, then any nonzero 
element a £ R' has the form 



a = p r a 0 



where r is an integer and p does not divide the numerator or 
denominator of a 0 . The integer r is called the order of a at p, written 
o p (a). If a = 0, we set o p (a) = oo. It is easy to see that if ab ^ 0 then 

Op(ab) = o p (a) + o p (b) 

If f(x) = J^ajX 1 is a nonzero polynomial in R'[x], we set 

°p( f ) = m j n °p( a i) 

and if f(x) = 0, we set o p (f) = oo. Then a content of f(x) is defined to be 

c ( f ) = u IIp Bp ^ 
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where u is any unit in R and the product is taken over all primes p for 
which o (f) ^ 0. Thus, content in R' is unique up to multiplication by a 
unit in R. 

For any a E R', we have c(c*p(x)) = uoc(p(x)) where u is a unit in R 
and so a is a content of f(x) E R'[x] if and only if 

f(x) = ap(x) 

where p(x) is a primitive polynomial (and hence in R[x]). It follows that 
f(x) E R[x] if and only if its content is in R. 

We now come to a key result concerning primitive polynomials. 

Theorem 1.2.1 Let R be a unique factorization domain and let R' be 
the field of quotients of R. 

1) (Gauss’ Lemma) If f(x) and g(x) are primitive in R[x] then so is 
f (x)g(x). 

2) If f(x), g(x) E R'[x] then c(fg) = uc(f)c(g), where u is a unit in R. 

3) Let f(x), g(x) E R[x], with g(x) primitive. If f(x) = g(x)h(x), where 
h(x) E R'(x) then, in fact, h(x) E R[x]. 

Proof. To prove Gauss’ Lemma, suppose that fg is not primitive. Then 
there exists an irreducible element r E R for which r | fg. Since R is a 
unique factorization domain, r is also prime. Hence (r) is a prime ideal 
and R[x]/(r) is an integral domain. Since r | fg, we have fg E (r) and so 

(f + (r))(g + (r)) = fg + (r) = (r) 

whence f -f (r) = (r) or g -f (r) = (r), that is, r | f or r | g. Hence, one of f 
or g is not primitive. 

To prove part 2), observe that if c f is a content of f(x) and c g is a 
content of g(x) then f = c f f and g = c g g', where f and g' are primitive. 
Hence, by Gauss’ Lemma 

c(fg) = c(c f c g fg') ~ C f c g c(f'g') = c f c g 

As to part 3), we have 

c(f) ~ c(g)c(h) ~ c(h) 

and since c(f) 6 R, so is c(h), whence h(x) G R[x]. I 

The previous theorem can be used to relate the irreducibility of a 
polynomial over a unique factorization domain R to its irreducibility 
over the field of quotients R' of R. The next theorem says in loose 




28 



1 Polynomials 



terms that the only difference between irreducibility over R and over R' 
is how constant factors are treated. 

Theorem 1.2.2 Let R be a unique factorization domain, with field of 
quotients R'. 

1) A primitive polynomial p(x) E R[x] is irreducible over R if and 
only if it is irreducible over R'. 

2) A polynomial f(x) E R[x] is irreducible over R if and only if it is 
either an irreducible element of R or a primitive polynomial that 
is also irreducible over R'. 

Proof. To prove part 1), observe that a primitive polynomial p(x) has 
no constant nonunit factors and so p(x) is irreducible over R if and only 
if it can be written as a product of nonconstant factors over R. Hence, if 
p(x) is reducible over R, it is also reducible over R'. On the other hand, 
if p(x) is reducible over R', then it has the form p(x) = f(x)g(x), where 
f(x) and g(x) are nonconstant polynomials in R'[x]. Now we may write 

p(x) = f'(x)[c(f)g(x)] 

where f'(x) is primitive and hence, by Theorem 1.2.1, c(f)g(x) is a 
polynomial over R. Thus f(x) is reducible over R as well. 

To prove part 2), note that if f(x) is a constant, then there is nothing 
to prove, since the constant nonunits in R[x] are precisely the nonunits 
in R. On the other hand, if f(x) has positive degree, then it is 
irreducible over R if and only if it is both primitive and irreducible over 
R and this is equivalent, by part 1), to being primitive and irreducible 
over R'. I 



1.3 The Division Algorithm 

The familiar division algorithm for polynomials over a field F can be 
easily extended to polynomials over a commutative ring with identity, 
provided that we divide only by polynomials whose leading coefficient is 
a unit. We leave proof of the following to the reader. 

Theorem 1.3.1 (Division algorithm) Let R be a commutative ring with 
identity. Let g(x) E R[x] have invertible leading coefficient. Then for 
any f(x) E R[x], there exist unique q(x), r(x) E R[x] such that 

f(x) = q(x)g(x) + r(x) 



where deg r(x) < deg g(x). 0 
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This theorem has some very important immediate consequences. 

Corollary 1.3.2 Let R be a commutative ring with identity and let 
f(x) G R[x]. Then a is a root of f(x) if and only if x — a is a factor of 
f(x) over R. 0 

Since the usual degree formula deg f(x)g(x) = deg f(x) + deg g(x) 
holds when R is an integral domain, we have the following. 

Corollary 1.3.3 If R is an integral domain then a nonzero polynomial 
f(x) G R[x] can have at most deg f(x) roots in R. □ 

In the customary way, a polynomial p(x) G R[x] can be thought of as 
a function on R. If R is an integral domain, Corollary 1.3.3 insures that 
if p(r) = 0 for an infinite number of distinct values of r G R then p(x) 
must be the zero polynomial. Thus, if R is infinite , then p(x) is zero as 
a function if and only if it is zero as a polynomial. Note that this does 
not hold for finite fields, for instance, the nonzero polynomial p(x) = 
x 2 — x is the zero function on Z 2 . This result can be extended to 
polynomials in more than one variable by induction and we leave the 
details to the reader. 

A polynomial in more than one variable may have infinitely many 
zeros, however, and yet not be the zero polynomial. For instance 
p(x,y) = x — y has infinitely many zeros over R. This example 
notwithstanding, we do have the following useful result, which says 
informally that if a polynomial has a whole subfield worth of zeros, 
then it must be the zero polynomial. 



Theorem 1.3.4 Let F be an infinite field and let L be an extension of F. 
Suppose that q(x 1 ,...,x n ) is a polynomial over L. If q(a 1 ,...,a n ) = 0 for 
all a| G F then q(x 1 ,...,x n ) is the zero polynomial. 

Proof. Write 

q(x 1 ,...,x n ) = £V e 
e 

where x e = x^’-oc^ 11 and A e G L. Let {/?J be a basis for L as a vector 
space over F. Then 

for a • G F and so 1 

e,i 



q(Xi,...,X n )= X) A e* e = EE a e/i X '= X^/X^i*') 



If bj G F, we have 
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0 = q(b 1 ,...,b n )=^/? i (^a e .b e ) 
i e 

and the independence of the /?j’s implies that 

E a e,i be = ° 

e 

for all i. Since this holds for all bj E F, the polynomial a e jx e over F 
must be the zero polynomial. It follows that a e j = 0 for all e and i, and 
so A e = 0 for all e, whence q(x 1? . . . jXjJ = 0. I 

Corollary 1.3.3 can be used to prove a fundamental fact concerning 
finite fields. 

Corollary 1.3.5 Let F be a finite field. The multiplicative group F* of 
all nonzero elements of F is cyclic. 

Proof. Let | F* | = q — 1 and let a have maximum order m < q — 1 
among all the elements in F*. Since F* is a finite abelian group, 
Theorem 0.2.2 implies that a m = 1 for all a E F*. Thus, every element 
of F* is a root of the polynomial x m — 1, which has at most m roots. 
Hence m = q — 1, and F* is cyclic. I 

In defining the greatest common divisor of two polynomials, it is 
customary (in order to obtain uniqueness) to require that it be monic. 

Definition Let f(x) and g(x) be polynomials over F. The greatest 
common divisor of f(x) and g(x), denoted by (f(x),g(x)) or 
gcd(f(x),g(x)), is the unique monic polynomial p(x) over F for which 

1) p(x) I f(x) and p(x) | g(x). 

2) If r(x) E F[x] and r(x) | f(x) and r(x) | g(x) then r(x) | p(x). 0 

The existence of greatest common divisors and the fact that d(x) = 
gcd(f(x),g(x)) is independent of the field F, that is, d(x) lies in any field 
K containing the coefficients of f(x) and g(x), follow from the fact that 
F[x] is a principal ideal domain. In particular, the ideal I = (f(x),g(x)) 
of K[x] is principal and so 1 = (p(x)) where p(x) E K[x], Since f(x) E 
(p(x)), we have p(x) | f(x) and similarly p(x) | g(x) over K and hence 
over any larger field F. Since p(x) E (f(x),g(x)), there exist a(x), b(x) E 
K[x] such that p(x) = a(x)f(x) + b(x)g(x). Hence, if q(x) | f(x) and 
q(x) | g(x) over F then q(x) | p(x) over F. Thus, p(x) = gcd(f(x),g(x)). 

Theorem 1.3.6 Let f(x), g(x) E F[x] and let K be the smallest subfield of 
F containing the coefficients of f(x) and g(x). Then there exist a(x), 
b(x) E K[x] such that gcd(f(x),g(x)) = a(x)f(x) + b(x)g(x) E K[x]. 0 
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Theorem 1.3.7 Let f(x), g(x) £ F[x] and let F < E. Then f(x) and g(x) 
have a nonconstant common factor over F if and only if they have a 
nonconstant common factor over E. 

Proof. Any common divisor h(x) of f(x) and g(x) over E is also a 
divisor of a(x)f(x) + b(x)g(x) = gcd(f(x),g(x)). Hence, if h(x) is 
nonconstant, so is gcd(f(x),g(x)). I 

Definition The polynomials f(x), g(x) £ F[x] are relatively prime if 
gcd(f(x),g(x)) = 1. In particular, f(x) and g(x) are relatively prime if 
and only if there exist polynomials a(x), b(x) £ F[x] for which 

a( x )f(x) + b(x)g(x) = 1 D 

Corollary 1.3.8 The polynomials f(x), g(x) £ F[x] are relatively prime if 
and only if they have no common roots in any extension field E of F. 

Proof. If gcd(f(x),g(x)) = 1 then a(x)f(x) + b(x)g(x) = 1 implies that 
f(x) and g(x) have no common roots in any extension. Conversely, if 
gcd(f(x),g(x)) is nonconstant, any of its roots is a common root of f(x) 
and g(x) in some extension. I 

Corollary 1.3.9 If f(x) and g(x) are distinct monic irreducible 
polynomials over F then they have no common roots in any extension E 
of F. D 



1.4 Splitting Fields 

It is a fundamental fact that every nonconstant polynomial 
f(x) £ F[x] has a root in some field. 

Theorem 1.4.1 Let F be a field, and let f(x) £ F[x] be a nonconstant 
polynomial. Then there exists an extension E of F and an a £ E such 
that f(a) = 0. 

Proof. We may assume that f(x) is irreducible. Consider the field E = 
F[x]/(f(x)). The field F may be thought of as a subfield of E, by 
identifying a £ F with a-f-(f(x)) £ E. Then x + (f(x)) is a root of f(x) in 
E. (We have actually shown that F can be embedded in a field in which 
f(x) has a root, but this is sufficient in view of Exercise 17 of Chapter 
2 .)| 

Repeated application of Theorem 1.4.1 gives the following corollary. 
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Corollary 1.4.2 Let f(x) E F[x]. There exists an extension E of F such 
that f(x) factors into linear factors over E. □ 

If a polynomial f(x) E F[x] factors into linear factors 

f(x) = a(x - ai )(x - a 2 )- • -(x - aj 

over an extension field E (that is, if a, a n E E), we say that f(x) 

splits in E. 

Definition Let ®F = {fj(x)} be family of polynomials over a field F. A 
splitting field for % is an extension field E of F with the property that 
each f|(x) in splits in E and that E is generated over F by the roots of 
the polynomials in 0 

Corollary 1.4.3 Every finite family of polynomials over a field F has a 
splitting field. 

Proof. Corollary 1.4.2 implies that any single polynomial has a splitting 
field. If is a finite family of polynomials, then a splitting field for is 
a splitting field for the product of the polynomials in 7. I 

We will see in the next chapter that any family of polynomials has a 
splitting field. We will also see that any two splitting fields S l and S 2 
for a family of polynomials over F are isomorphic by an isomorphism 
that fixes each element of the base field F. 



1.5 The Minimal Polynomial 

Let F < E. An element a E E is said to be algebraic over F if there is 
some nonzero polynomial f(x) E F[x] for which f(a) = 0. An element 
that is not algebraic over F is said to be transcendental over F. 

If a is algebraic over F, the set 

3 = {g(x) € F[x] I g(a) = 0} 

is a nonzero ideal in F[x] and is therefore generated by a unique monic 
polynomial p(x), called the minimal polynomial of a over F and 
denoted by min(a,F). The following theorem characterizes minimal 
polynomials in a variety of useful ways. Proof is left to the reader. 

Theorem 1.5.1 Let F < E and let p(x) = min(a,F) where a E E. Then 
among all polynomials in F[x], the polynomial p(x) is 
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1) the unique monic irreducible polynomial for which p(a) = 0 

2) the unique monic polynomial of smallest degree for which 
p(«) = 0 

3) the unique monic polynomial with the property that f(c*) = 0 if 
and only if p(x) | f(x). 0 

Definition Let F < E. Then a, /? G E are said to be conjugates over F if 
they have the same minimal polynomial over F. 0 



1.6 Multiple Roots 

Definition Let a be a root of f(x) G F[x]. The multiplicity of a is the 
largest positive integer n for which (x — a) n divides f(x). If n = 1, we 
say that a is a simple root and if n > 1, we say that o is a multiple root 
of f(x). □ 

Definition An irreducible polynomial f(x) G F[x] is said to be separable 
if it has no multiple roots in any extension of F. An irreducible 
polynomial that is not separable is inseparable. 0 

Although, as we now show, all irreducible polynomials over a field of 
characteristic zero or a finite field are separable, the concept of 
separability plays a key role in the theory of more “unusual” fields. 

Theorem 1.6.1 A polynomial f(x) has no multiple roots if and only if 
f(x) and its derivative f'(x) are relatively prime. 

Proof. Over a splitting field E for f(x), we have 

f(x) = (x-a 1 ) e l---(x-a n ) e n 

where the cq’s are distinct. It is easy to see that f(x) and f (x) have no 
nontrivial common factors over E if and only if ej = 1 for all i. Thus, 
f(x) has no multiple roots in E if and only if f(x) and f (x) are relatively 
prime. I 

Corollary 1.6.2 An irreducible polynomial f(x) is separable if and only if 
f(x)^0. 

Proof. Since deg f (x) < deg f(x) and f(x) is irreducible, we deduce that 
f(x) and f (x) are relatively prime if and only if f (x) ^ 0. I 

If char(F) = 0 then f (x) ^ 0 for any nonconstant f(x). Thus, we get 
the following corollary. 
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Corollary 1.6.3 All irreducible polynomials over a field of characteristic 
0 are separable. 0 

For char(F) = p ^ 0, the next result says that the inseparable 
polynomials are precisely the polynomials of the form g(x^) for some 
d> 1. 

Corollary 1.6.4 Let char(F) = p ^ 0 and let f(x) € F[x] be irreducible. 

1) If f(x) is inseparable, then there exists a positive integer d such 
that f(x) = q(x pd ), where q(x) is separable. In this case, all roots of 
f(x) have multiplicity p d . 

2) If f(x) = h(x pd ) where h(x) is any nonconstant polynomial and d is 
a positive integer, then f(x) is inseparable. 

Proof. For the first statement in part 1), suppose that f(x) = J^x 1 has 
a multiple root in some extension E of F. Then f (x) = 0 which implies 
that iaj = 0 for all i, which in turn implies that p | i for all i such that 
aj 0. Hence, f(x) = q(x p ). If q(x) has no multiple roots, we are done. 
If not, then we may repeat the argument with the irreducible 
polynomial q(x), eventually obtaining the desired result. 

For part 2), if h(x) is not separable, then by part 1), we have h(x) = 
q(x p ) where q(x) is separable and so 

f(x) = h(x pd ) = q(x pd+k ) 

Thus, we may suppose that h(x) is separable. Let K be a field in which 
both f(x) and h(x) split. Over K, we have h(x) = (x — a^)* • *(x — a k ) and 
so 

f(x) = (x pf '-a 1 >--(x^-Q' k ) 

where the a x E K are distinct. Since f(x) splits in K, there exist roots 
/?j € K for each of the factors x pd — c^, and so a x = Hence, 

f(x) = (x pd -/?f)-(x^-^) 

Since char(F) = p, 

f(x) = (x - • -(x - /? k / 

which shows that all the roots of f(x) have multiplicity p d . This proves 
part 2) and also the second statement in part 1). I 

Corollary 1.6.5 All irreducible polynomials over a finite field are 
separable. 
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Proof. Let char(F) = p. The field F is an extension of its prime subfield 
Z p and if the dimension of F as a vector space over Z p is n, then F has 
q = p n elements. Hence, the multiplicative group F* of nonzero 
elements of F has order q — 1 and so a q = a for all a E F. In particular, 
any element of F is a p-th power of some other element of F. Thus, any 
polynomial of the form q(x p ) satisfies 



q(x p ) = a 0 + ajX p + h a R x pn 

= bg +b?x p + ... + b p x np 
= (b 0 + bjX H h b n x n ) p 

and so is not irreducible. I 

We should note that in infinite fields of nonzero characteristic, there 
are irreducible polynomials with multiple roots. 

Example 1.6.1 Let F be a field of characteristic 2 and consider the field 
F(y) of all rational functions in the variable y. The polynomial f(x) = 
x 2 — y 2 is irreducible over the subfield F(y 2 ), since it has no linear 
factors over F(y 2 ). However, in F(y) we have f(x) = (x — y) 2 and so y is 
a double root of f(x). 0 



1.7 Testing for Irreducibility 

We discuss two well-known methods for testing a polynomial for 
irreducibility. 

Theorem 1.7.1 (Eisenstein’s criterion) Let R be an integral domain and 

let p(x) = a 0 + a 2 x H b a R x n E R[x] have relatively prime coefficients. 

If there exists a prime p E R satisfying 

p | a 5 for 0 < i < n, p/a n , p 2 /a 0 

then p(x) is irreducible. 

Proof. Suppose that p(x) = f(x)g(x) where neither factor is a unit. If 
f(x) = f 0 E R then f 0 divides aj for all i, implying that f(x) = f 0 is a 
unit, which is not the case. Thus, deg f(x) > 0 and similarly 
deg g(x) > 0. Let 



f(x) =f 0 + f iX + --- + f k xk and g(x) = g 0 + g x x + • • • + g m x m 



Since a 0 = f 0 g 0 and p | a 0 , p 2 / a 0 we may assume that p | f 0 and p/g 0 - 
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Let 0 < i < n be the smallest integer for which p/fj and consider the 
coefficient 

a i = f ogi + f ig-,-i + "- + f igo 

We have p | a ; , p | f 0 gj, . . . , fj_ 1 g 1 but p/f;g 0 , a contradiction. Hence p(x) 
is irreducible. I 

Eisenstein’s criterion can be useful as a theoretical tool. 

Corollary 1.7.2 For every positive integer n, there is an irreducible 
polynomial p n (x) of degree n over the integers. 0 

A useful approach to testing for irreducibility over Z[x], and hence 
also over Q[x], is localization . For a prime p, let <r:Z— »Z p be the natural 
map 

an = n = n + (p) 

If p(x) E Z[x] we denote (<rp)(x) by p(x). 

Theorem 1.7.3 Let p(x) = a 0 -f a x x H h a n x n E Z[x] be primitive. Let 

p be a prime that does not divide a n . If p(x) is irreducible over Z p then 
p(x) is irreducible over Z. 

Proof. Assume that p(x) is irreducible over Z but that p(x) = f(x)g(x) 
is the product of nonunits over Z. Then p(x) = f(x)g(x). Since a n ^ 0 
mod p, we have 

deg f(x) + deg g(x) = deg p(x) = deg p(x) = deg f(x) + deg g(x) 

which implies that deg f(x) = deg f(x) and deg g(x) = deg g(x). Since 
p(x) is irreducible, we must have deg f(x) = 0 or deg g(x) = 0, implying 
that one of f(x) or g(x) is a constant (nonunit), in contradiction to the 
primitiveness of p(x). Hence, p(x) is irreducible over Z. I 



Exercises 

1. Prove that if R is an integral domain then so is R[x 1 ,...,x n ]. 

2. Describe the units in F[x] where F is a field. 

3. Let R be an integral domain. Prove that c(ap(x)) ~ ac(p(x)) for 
any p(x) E R[x] and a E R. 

4. Prove that if n > 1 then the ring F[x 1 ,...,xJ is not a principal 
ideal domain. 

5. If f(x) E R[x] where R is an integral domain with field of quotients 
R', then f(x) can also be viewed as a polynomial in R'[x]. Show 
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that the definition of content for f(x) E R[x] agrees with the 
definition of content for f(x) E R'[x]. 

6. Verify the division algorithm (Theorem 1.3.1) for commutative 
rings with identity. Hint . try induction on deg f(x). 

7. Show that the condition that p(x) be primitive is essential in the 
first part of Theorem 1.2.2. 

8. Prove Theorem 1.5.1. 

9. Let deg p(x) = d. The reciprocal polynomial is q(x) = x d p(x"" 1 ). 
Are the irreducibility of p(x) and q(x) related? Can you deduce an 
alternate version of Eisenstein’s criterion from this? 

10. Show that if p is a prime in an integral domain R, the polynomial 
p(x) = x n — p is irreducible. 

11. Prove that for every positive integer n there is an irreducible 
polynomial p n (x) E Z[x] of degree n. 

12. For p prime show that p(x) = 1 -f x + x 2 H bx p ~‘ 1 is irreducible 

over Z[x]. Hint apply Eisenstein to the polynomial p(x+l). 

13. Use the idea of localization (apply the map cr) to deduce that 
Eisenstein’s criterion implies irreducibility in Z[x], 

14. Prove that for p prime, x n + px + p 2 is irreducible over Z[x]. 

15. If R is an infinite integral domain and p(x 1 ,...,x n ) is a 
polynomial in several variables over R, show that p(x 1 ,...,x n ) is 
zero as a function if and only if it is zero as a polynomial. 

If f(x) is a polynomial of degree d, we define the reciprocal polynomial 

by f R (x) = x^x"” 1 ). Thus, if 

f(x) = a n x n + a n _ 1 x n “ 1 + b ajx -f a 0 

then 

f R (x) = a 0 x n + a lX n-1 + • • • + a n _ lX + a n 

If a polynomial satisfies f(x) = f R (x), we say that f(x) is self-reciprocal. 

16. Show that a ^ 0 is a root of f(x) if and only if a” 1 is a root of 
f R(*)- 

17. Show that the reciprocal of an irreducible polynomial f(x) / x is 
also irreducible. 

18. Show that if a polynomial f(x) is self-reciprocal and irreducible, 
then deg f(x) must be even. 

19. Suppose that f(x) = p(x)q(x), where p(x) and q(x) are irreducible, 
and f(x) is self-reciprocal. Show that either 

(i) p(x) = <5p R (x) and q(x) = Sq^Ax) with 6 = ± 1, or 

(ii) p(x) = aq R (x) and q(x) = a -1 p R (x) for some a E GF(q). 

What can you say about this if deg p(x) is odd? 

20. There is a simple (but not necessarily practical) algorithm for 
factoring any polynomial over Q, due to Kronecker. In view of 
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1 Polynomials 



Theorem 1.2.2, it suffices to consider polynomials with integer 
coefficients. Prove that a polynomial of degree n is completely 
determined by specifying n + 1 of its values. Hint Use the 
Lagrange Interpolation Formula 



p( x ) = 2*1 p(0 

i=0 



m 

J 



x-J 



Let f(x) be a polynomial of degree n > 1 over Z. If f(x) has a 
nonconstant factor p(x) of degree at most n/2, what can you say 
about the values p(i) for i = 0,...,[n/2j? Construct an algorithm 
for factoring f(x) into irreducible factors. 




Chapter 2 

Field Extensions 



Field extensions F < E can be characterized in a variety of useful 
ways. Some characterizations involve properties of the individual 
elements of the extension. For instance, an extension F < E is algebraic 
if each element a E E is algebraic over F. Other characterizations 
involve the field E as a whole. For instance, F < E is normal if E is the 
splitting field for a family of polynomials over F. In this chapter, we 
will describe several types of extensions and study their basic properties. 



2.1 The Lattice of Subfields of a Field 

If E is an extension field of F, then E can be viewed as a vector space 
over F. The dimension of E over F is denoted by [E:F] and called the 
degree of E over F. A sequence of fields E 1 ,...,E n for which Ej < E i+1 
is referred to as a tower of fields, and we write E x < E 2 < • • • < E n . The 
fact that dimension is multiplicative over towers is fundamental. 

Theorem 2.1.1 Let F < K < E. Then 

[E:F] = [E:K][K:F] 

Moreover, if A = {cq | i € 1} is a basis for E over K and B = {/?• 1 j*E J} 
is a basis for K over F, then the set C = | i € I, j € J} is a basis for 

E over F. 
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Proof. For the independence of C, if = 0 then £^aj jcq = 0 for 

all j, and the latter implies that ajj = 0 for all i, j. Hence, C is 
independent. Next, if 7 G E then there exist aj G K such that 7 = 
5]^. Since each a^ is a linear combination of the /?j’s, it follows that 
7 is a linear combination of the products cq/?j. Hence C is a basis for E 
over F. I 

If F and E are subfields of a field K, then the intersection F fl E is 
clearly a field. The composite FE of F and E is defined to be the 
smallest subfield of K containing both F and E. The composite FE is 
also equal to the intersection of all subfields of K containing E and F. 
More generally, the composite V Ej of a family 8 = {Ej | i G 1} of fields, 
all of which are contained in a single field E, is the smallest subfield of 
E containing all members of the family. Note that the composite of 
fields is defined only when the fields are all contained in one larger field. 
Whenever we form a composite, it is with the tacit understanding that 
the relevant fields are so contained. 

A monomial over a family 8 = {Ej | i G 1} of fields with Ej < E is an 
element of E of the form 

e ; e ; • • *e ; , where e ; G E ; 

*1 l 2 V *k >k 

Note that the set of all finite sums of monomials over 8 is the smallest 
subring R of E containing each field Ej and the set of all quotients of 
elements of R (the quotient field of R) is the composite V Ej. Thus, 
each element of V Ej involves only a finite number of elements from the 
union (J Ej and is therefore contained in a composite of a finite number 
of fields from the family 8. 

The collection of all subfields of a field K forms a complete lattice L 
(under set inclusion), with meet being intersection and join being 
composite. The zero element in L is the prime subfield of K and the 
unit element is K itself. 



2.2 Distinguished Extensions 

Following Lang, we will say that a class C of field extensions is 
distinguished provided that 

Dl) If F < K < E, then (F < E) G C if and only if (F < K) G C and 
(K < E) G C. 

D2) If (F < E) G C and F < K and EK is defined, then (K < EK) G C. 
Note that if C is distinguished, then 
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D3) If (F < E) E C and (F < K) E C and EK is defined, then 
(F < EK) E C. In other words, C is closed under taking (a finite 
number of) composites. 

Figure 2.2.1 illustrates Dl) and D2). We refer to K < EK as the 
lifting of the extension F < E by K. 

E EK 




Figure 2.2.1 

If a class C of extensions has the property that whenever (F < Ej) E C 
for each member of a family {EJ of fields and if V Ej is defined, then 
(F < V Ej) E C, we say that C is closed under the taking of arbitrary 
composites. 



2.3 Finitely Generated Extensions 

If S is a subset of a field E and if F < E, we denote the smallest 
subfield of E containing F and S by F(S). When S = {a 1 ,...,a n } is a 
finite set, it is customary to write F(a 1 ,...,a n ) for F(S). Note that for 
1 < k < n — 1, 



F Kv,aJ = [F(a 1 ,...,a k )](a k+ i,...,a n ) 

Definition Any field of the form E = F(a 1 ,...,a n ) is said to be finitely 
generated over F. We also say that the extension F < E is finitely 
generated. Any extension of the form F < F(a) is called a simple 
extension and a is a primitive element in F(a). 0 

The reader may have encountered a different meaning of the term 
primitive in connection with elements of a finite field. We will discuss 
this alternate meaning when we discuss finite fields later in the book. 

It is evident that F(a 1 ,...,a n ) consists of all quotients of 
polynomials in the cq’s: 
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Theorem 2,3.1 The class of all finitely generated extensions is 
distinguished. 

Proof. For Dl, let F < K < E. If E = K(S) and K = F(T) where S and 
T are finite, then E = F(SUT) is finitely generated over F. Clearly, if 
F < E is finitely generated then K < E is also finitely generated by the 
same set of generators. However, the proof that F < K is finitely 
generated must be postponed until we have discussed transcendental 
extensions in the next chapter. Statement D2 follows from the fact that 
if E = F(S), S finite, and F < K then 

KE = K(F(S)) = K(S) 
and so KE is finitely generated over K. I 



2.4 Simple Extensions 



Since F[x] is a principal ideal domain, the ideal (p(x)) generated by 
p(x) G F[x] is maximal, and the quotient ring 



F[x 

- (p( x )> 



is a field, if and only if p(x) is irreducible. We can use this observation 
to characterize simple algebraic extensions. 



Theorem 2.4.1 Let F < E and let a G E be algebraic over F. Then F(a) 
is isomorphic to the field 

_F[x]_ 

(min(a,F)) 

Proof. Let ^:F[x]— >E be the evaluation (ring) homomorphism defined 
by ^(fW) = f(<*)- The kernel of ^ is the ideal (min(a,F)), and so K is 
isomorphic to ^(F[x]), which implies that V>(F[x]) is a field. Thus, we 
need only show that ^(F[x]) = F(a). Clearly, ^(Ffc]) C F(a). But 
a = V>(x) G V>(F[x]) and F C ^(F[x]) imply that F(a) C ^>(F[x]). Hence, 
tf(F[x]) = F(a). I 



Let p(x) be irreducible over F. Since addition and multiplication in 
K = F[x]/(p(x)) is done using coset representatives and since 



K' = {f(x) G F[x] | deg f(x) < deg p(x)} 

is a complete set of distinct coset representatives for K, we may identify 
K with K', where addition and multiplication are performed modulo 
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p(x). This allows us the customary practice of thinking of F as a 
subfield of K. Note also that, as a vector space over F, we have 
dim K = degp(x). In the symbolism of Theorem 2.4.1, we have 
[F(a):F] = deg mm(a,F). 

Thus F(o) is the set of all polynomials in a of degree less than d = 
deg m*n(c*,F), with addition and multiplication modulo min(a, F). It 
follows that the set {ljO,...,^” 1 } is a basis for F(a) over F. 

As for simple transcendental extensions, we have the following. 

Theorem 2.4.2 Let F < E and let a E E be transcendental over F. Then 
F(a) is isomorphic to the field of all rational functions F(x) in a single 
variable x. 

Proof. The evaluation homomorphism \p:F(x)-+E is injective, for if 
f(a)/g(c*) = 0 then f(a) = 0, which implies that f(x) = 0, since otherwise 
a would be algebraic. Since ^(F(x)) = F(a), we deduce that rp is an 
isomorphism from F(x) onto F(a). I 



2.5 Finite Extensions 

If F < E and [E:F] is finite, we say that E is a finite extension of F or 
that F < E is finite. We have already seen that the following is true. 

Theorem 2.5.1 If F < E and if a £ E is algebraic over F then F < F(a) 
is finite, and [F(a):F] = deg min(a,F). 0 

Theorem 2.5.2 An extension is finite if and only if it is finitely 
generated by algebraic elements. 

Proof. If F < E is finite and if {a 1 ,...,o n } is a basis for E over F, then 
E = F(a 1 ,...,a n ) is finitely generated over F. Moreover, for each k, the 
infinite set of nonnegative powers of a k cannot be linearly independent 
over F, it follows that a k must be algebraic over F. 

For the converse, assume that E = F(o 1 ,. . . ,a n ), where each aj is 
algebraic over F, and consider the tower 

F < F(aj) < F(a 1 ,a 2 ) < • • • < F(a 1( . . . , a n ) = E 

Since is algebraic over F(aj,. . . ,<*;_! ), each extension in the tower is 
finite, and so E is finite over F by Theorem 2.1.1. 1 

Suppose that E = F(o 1 ,...,a n ) is finitely generated by algebraic 
elements over F and consider the tower 
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F < F(a t ) < F(aj,a 2 ) < ••• < F(a 1 ,...,a n ) = E 

Our results on simple algebraic extensions show that any element of 
F(a 1 ) is a polynomial in over F. Further, any element of F(a 1 ,a 2 ) is 
a polynomial in a 2 over F(a 1 ), and hence a polynomial in the two 
variables a x and a 2 . Continuing in this way, we conclude that E is the 
set of all polynomials over F in or-^, . . - , « n . 

Theorem 2.5.3 The class of finite extensions is distinguished. 

Proof. The multiplicativity of degree shows that D1 holds. For D2, let 
F < E be finite, with basis {a v ...,a n } and let F < K. Thus E = 
F(a 1 ,. ..,a n ) where each ctj is algebraic over F and so also over K. Since 
EK = K(a 1 ,. ..,a n ) is finitely generated by elements algebraic over K, it 
is a finite extension of K. I 

Note that if E is a splitting field for p(x) E F[x] then E is generated 
by a complete set of distinct roots a v ...,a of p(x). Thus E = 
F(a 1 ,...,a n ) is finitely generated by algebraic elements and so is a 
finite extension of F, of degree at most d!, where d = deg p(x). This also 
applies to the splitting field for any finite set of polynomials over F. 

Suppose that F < E is finite and let B = {/?j,. . . , /? n } be a basis for E 
over F. If F < K, then since EK = K(/? 1 ,...,/? n ) and each /? • is algebraic 
over F, and hence also over K, it follows that EK is the set of 
polynomials over K in /? 1? — , /? n . However, any monomial in the /?.’s is 
a linear combination (over F) of /?i,...»/? n and so EK is the set of 
linear combinations of /3 v ...,(3 n over K. In other words, B spans EK 
over K. We have proved the following, which says that a lifting cannot 
increase degree. 

Theorem 2.5.4 If B is a basis for E over F and if F < K then B spans 
EK over K. In particular, if F < E is finite then [EK:K] < [E:F]. Q 

The next theorem characterizes finite simple extensions. 

Theorem 2.5.5 A finite extension F < E has the form E = F(a) for 
a E E if and only if there are only a finite number of intermediate fields 
F < K < E between E and F. 

Proof. Suppose first that E = F(a), and that p(x) = rain(a,F). Define a 
map ^ that assigns to each intermediate field K the polynomial ip(K) = 
min(a,K). Since p(x) E K[x] and p(a) = 0, we have ^(K) | p(x). But a 
monic polynomial has only a finite number of monic divisors. Hence, 
the range of ^ is finite and therefore it is sufficient to show that is 
injective. Let K be an intermediate field, let S be the set of coefficients 
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of rp(K) and consider the tower F(S) < K < F(a). Since ^>(K) is a monic 
irreducible polynomial over F(S) and is satisfied by a, we have ip(K) = 
min(a,F(S)). Hence, [F(a):K] = deg ip(K) = [F(a):F(S)], which implies 
that [K:F(S)] = 1, that is, K = F(S). This shows that K is uniquely 
determined by the polynomial V’(K), and so tj; is injective. 

For the converse, if E is a finite field, the multiplicative group E* of 
nonzero elements of E is cyclic. If a generates this group, then E = F(a) 
is simple. Now suppose that E is an infinite field and there are only 
finitely many intermediate fields between E and F. Let a, /3 E E and 
consider the intermediate fields F(c*4-a/3), for a E F. By hypothesis, 
F(a + a (3) = F(c* 4- b /3) for some a ^ b E F. Hence, a 4- b/3 E F(a 4- a/?), 
implying that 

0 = ^[(« + a/3) - (a + b/3)] e F(a + a/?) 

and 

a = (a*f a /3) — a/3 E F(a 4* a/3) 

Hence, F(a,/3) C F(a 4- a/3). The reverse inclusion is evident and so 
F(a,/3) = F(a4-a/3), showing that any extension of F generated by two 
elements is a simple extension. Since F < E is finite, it is finitely 
generated and an inductive argument can be used to show that F < E is 
simple. | 

2.6 Algebraic Extensions 

Definition An extension E of F is algebraic over F (or F < E is 
algebraic) if every element a E E is algebraic over F. Otherwise, E is a 
transcendental extension of F. □ 

Theorem 2.6.1 A finite extension is algebraic. 

Proof. If F < E is finite and a E E then the sequence of powers 1, a, 
a 2 ,... cannot be linearly independent over F and therefore some 
nontrivial polynomial in a must equal 0, implying that a is algebraic 
over F. I 

Corollary 2.6.2 Any extension that is finitely generated by algebraic 
elements is algebraic. □ 

Theorem 2.6.3 Let F < E. The set K of all elements of E that are 
algebraic over F is a field, called the algebraic closure of F in E. 

Proof. Let a, /3 eK. The field F(a,/3) is finitely generated over F by 
algebraic elements and so is algebraic over F, that is, F(a,/3) C K. This 
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implies that a 1 , a±/? and a (3 all lie in K, and so K is a subfield of 

E. I 

Theorem 2.6.4 The class of algebraic extensions is distinguished. It is 
also closed under the taking of arbitrary composites. 

Proof. For Dl, let F < K < E. It is clear that if F < E is algebraic then 
so is F < K. Also, since any polynomial over F is a polynomial over K, 
K < E is also algebraic. Conversely, suppose that F < K and K < E are 
algebraic and let a E E have minimal polynomial p(x) = ape 1 over K. 
Consider the tower of fields 

F < F(a 1 ,...,a n ) < F(a 1 ,...,a n ,a) 

Since a is algebraic over F(a 1 , . . . ,a n ) and each a i? being in K, is 
algebraic over F, we deduce that each step in the tower is finite and so 
F < F(a 1? . . .,a n ,c*) is finite. Hence, a is algebraic over F. 

For D2, let F < E be algebraic and let F < K, with E and K 
contained in a field L. We must show that K < EK is algebraic. Let A 
be the algebraic closure of K in EK. Certainly K < A < EK. Since each 
element of E is algebraic over F it is a fortiori algebraic over K and so 
E < A. Hence, EK < A < EK, showing that EK = A is algebraic over K. 

Finally, if {Ej is a family of fields, each algebraic over F, then so is 
V E i? since an element of V Ej is also an element of a composite of only 
a finite number of members of the family. I 

The algebraic closure of the rational numbers Q in the real numbers 
R is the field A of algebraic numbers. We saw in the previous chapter 
that there is an irreducible polynomial p n (x) E Z[x] of every positive 
degree n. Hence, A is an infinite algebraic extension of Q, showing that 
the converse of Theorem 2.6.1 does not hold. 

We note finally that if F < E is algebraic and if E = F(S) for some 
S C E then each element of E is a polynomial in finitely many elements 
from S. This follows from the fact that each a E F(S ) is a rational 
function in finitely many elements of S and so there exists a finite 
subset S 0 CS such that a E F(S 0 ). Hence, our discussion in Section 2.5 
related to finitely generated algebraic extensions applies here. 



2.7 Algebraic Closures 

Definition A field E is said to be algebraically closed if any nonconstant 
polynomial with coefficients in E splits in E. □ 
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Theorem 2.7.1 Let F be a field. Then there is an extension E of F that 
is algebraically closed. 

Proof. The following proof is due to Emil Artin. The first step is to 
construct an extension field F 1 of F, with the property that all 
nonconstant polynomials in F[x] have a root in F v To this end, for 
each nonconstant polynomial p(x) £ F[x], we let X p be an independent 
variable and consider the ring of all polynomials in the variables X p 
over the field F. Let 3 be the ideal generated by the polynomials p(X p ). 
We contend that 3 is not the entire ring For if it were, then there 
would exist polynomials q v . . . , q n £ % and p 1? . . . , p n £ 3 such that 

<*lPl( X Pl ) + "' + q n Pn( X p n ) = 1 

This is an algebraic expression over F in a finite number of independent 
variables. But there is an extension field E of F in which each of the 
polynomials p 1 (x),...,p n (x) has a root, say ofj, . . . , Qf n - Setting X p< = cq 
and setting any other variables appearing in the equation above equal 
to 0 gives 0 = 1. This contradiction implies that 5 

Since 3 ^ there exists a maximal ideal J such that 3 C } C 

Then F 1 = is a field in which each polynomial p(x) £ F[x] has a 

root, namely X -f J. (We may think of F 1 as an extension of F by 
identifying a £ F with a + J.) 

Using the same technique, we may define a tower of field extensions 

F < F 1 < F 2 < ••• 

such that each nonconstant polynomial p(x) £ Fj[x] has a root in F i+1 . 
The union E = (J Fj is an extension field of F. Moreover, any 
polynomial p(x) £ E[x] has all of its coefficients in F- for some i and so 
has a root in F i+1 , hence in E. It follows that every polynomial p(x) £ 
E[x] splits over E. Hence E is algebraically closed. I 

Definition Let F < E. Then E is an algebraic closure of F if F < E is 
algebraic and E is algebraically closed. We will denote an algebraic 
closure of a field F by F. 0 

Theorem 2.7.2 Let F < E. The following are equivalent. 

1) E is an algebraic closure of F. 

2) F < E is algebraic and any nonconstant polynomial p(x) over F 
splits in E. 

3) E is a maximal algebraic extension of F, that is, F < E is algebraic 
and if E < K is algebraic then K = E. 
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Proof. Clearly 1) implies 2). Suppose 2) holds and E < K is algebraic. 
Let a G K. Then F < E < E(a) is an algebraic tower and so a is 
algebraic over F. But the minimal polynomial m*n(a,F) splits in E and 
so a G E. Thus K = E and 3) holds. Finally, suppose 3) holds and let 
p(x) € E[x]. Any splitting field K for p(x) is algebraic over E and so 
must equal E, which implies that p(x) splits in E. Hence, 1) holds. I 

We can now easily establish the existence of algebraic closures. 

Theorem 2.7.3 Let F < A < E where A is the algebraic closure of F in 
E. If E is algebraically closed then A is also algebraically closed and 
hence is an algebraic closure of F. Thus, any field has an algebraic 
closure. 

Proof. We have already seen that A is an algebraic extension of F. By 
hypothesis, any p(x) E A[x] splits in E and so all of its roots lie in E. 
Since these roots are algebraic over A, they are also algebraic over F 
and thus lie in A. Hence p(x) splits in A and so A is algebraically 
closed. The final statement follows from Theorem 2.7.1. I 



2.8 Embeddings 

Homomorphisms between fields play a key role in the theory. Since a 
field F has no ideals other than {0} and F, it follows that any nonzero 
(ring) homomorphism <r:F— >L from F into L must be a monomorphism. 
If f:A— >B is any function and if C C A, we denote the restriction of f to 
C by f | c . 

Definition Let F and L be fields. A monomorphism <r:F— >L is called an 
embedding of F into L. We will denote the image of F under a by <tF 
or F°\ If F < E, an embedding r:E— »L for which r | F = a is called an 
extension of a to E. An embedding of E that extends the identity map 
*:F— >F is called an embedding over F, or an F-embedding. We will 
denote the set of all embeddings of E into L over F by ^Tom F (E,L). If 
p(x) = ^ajx 1 € F[x] and if <t:F— »L is an embedding we denote the 
polynomial J^o^a-Jx 1 by (<rp)(x) or p <T (x). 0 

Lemma 2.8.1 

1) Let <r:F— »L be an embedding of F into L and let p(x) G F[x]. Then 
a E F is a root of p(x) if and only if era is a root of p <T (x). 

2) If cr:K— >L is an embedding of K into L and if {Ej | i G 1} is a 
family of subfields of K then er( V Ej) = V <tF. 

3) If or: K— >L is an embedding of K into L and if F < K and 

a n G K then 
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(r(F(a 1 ,...,a n )) = F‘ T (<ra 1 ,...,o-a n ) 

Proof. Part 1) follows from the fact that <7(p(a)) = p iT (aa). For part 2), 
since a is injective, it preserves intersections. But 

V Ej = f| {H I Ej < H < K for all i G 1} 

and so 

„(vE,)=r> H | Ej < H < K for all i E 1} 

Since <r:K— >crK is an isomorphism, every IT satisfying <rEj < IF < crK is 
of the form crH for some H satisfying Ej < H < K and so 

a( V Ej) = f| {H' | crEj < H' < <rK for all i G 1} = V (tE; 

We leave proof of part 3) to the reader. I 

Even though the next result has a simple proof, the result is of major 
importance. 

Theorem 2.8.2 Let F < E be algebraic and let cr:E— »E be an embedding 
of E into itself over F. Then cr is an automorphism of E. 

Proof. Let a E E and let p(x) = min(a y F). Let S be the set of roots of 
p(x) that lie in E. Then a E S. If /? E S then a/3 is also a root of p(x) in 

E, and so <rj3 E S. Hence, a | s is a permutation on S and so there is a 
f3 E S for which a ft = a . This shows that cr is surjective, hence an 
automorphism of E. I 

It is a cornerstone of the theory that an embedding <r:F— »L into an 
algebraically closed field can be extended to any algebraic extension of 

F. We begin with the case of a simple algebraic extension. 

Suppose that cr:F— >L is an embedding of F into an algebraically 
closed field L. If F < E and a E E is algebraic over F then we may take 
advantage of the fact that a satisfies its minimal polynomial p(x) over 
F. to extend a to F(a) as follows. Since L is algebraically closed, 
p <r (x) E F ff [x] splits in L, and since a is an embedding, p <T (x) is 
irreducible over crF. Hence p <T (x) is the minimal polynomial over crF of 
any of its roots in L. Let /? be a root of p <T (x) in L. Then 

F(a) = {f(a) | f(x) E F[x], deg f(x) < deg p(x)} 

and since deg p^(x) = deg p(x), 

= {g (/?) I g(x) e F^x], deg g(x) < deg p(x)} 
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Thus we may define a map cr:F(a)—>F <r (/?) by 

<f(f(a))=r(/?) 

for any f(x) £ F[x]. It is straightforward to show that <r is an embedding 
of F(ct) into F a (/3) over a and that era = /?. This proves the first part of 
the following theorem. The rest of the theorem follows easily. 

Theorem 2.8.3 Let F < E and let a £ E be algebraic over F, with 
minimal polynomial p(x) = min(a,F). Let a:F->L be an embedding of 
F into an algebraically closed field L. 

1) If /? is a root of p° r (x) in L then a can be extended to an 
embedding <f:F(a)— >L for which era = /?. 

2) Any extension of <r to F(a) must map a to a root of p^(x) in L. 

3) The number of extensions of <r to F(a) is equal to the number of 
distinct roots of mm(c*,F) in F. I 

Zorn’s Lemma can now be used to extend the first part of this 
theorem to arbitrary algebraic extensions. 



Theorem 2.8.4 Let F < E be algebraic. Any embedding er: F— >L into an 
algebraically closed field L can be extended to an embedding <r:E— >L. 
Moreover, if a £ E, p(x) = min(a,F) and /? £ L is a root of p^x), then 
we can arrange it so that era = /?. (See Figure 2.8.1.) 

Proof. Let 8 be the set of all embeddings r:K— >L over a for which 
ra = (3 and F < K < E. Theorem 2.8.3 implies that 8 is not empty. 
Order the elements of 8 by saying that (r':K'— »L) > (r:K— >L) if K < K' 
and t' is an extension of r. Then 8 is a partially ordered set. If C = 
{r^Kj-^L} is a chain in 8 , the map rrUKj— »L defined by the condition 
r | K = r- is an upper bound for C in 8 . Zorn’s Lemma implies the 
existence of a maximal extension r:K— >L. We contend that K = E, for 
if not, there is an element 7 £ E — K. But 7 is algebraic over K and so 
we may extend r to K( 7 ), contradicting the maximality of r. I 



algebraic 




Figure 2.8.1 
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We can now establish the essential uniqueness of algebraic closures. 

Corollary 2.8.5 Any two algebraic closures of a field F are isomorphic. 

Proof. Let K and L be algebraic closures of F. The identity map i:¥—+F 
can be extended to an embedding r:K— ►L. Since K is algebraically 
closed so is rK. But L is an algebraic extension of rK and so L = rK. 
Hence, r is an isomorphism. I 

We close this section with a highly useful result on independence of 
embeddings. We choose a somewhat more general setting, however. A 
monoid is a nonempty set M with an associative binary operation and 
an identity element. If M and M' are monoids, a homomorphism of M 
into M' is a map ►M' such that = tp(a)rp(P) and rp(l) = 1. 

Definition Let M be a monoid and let K be a field. A homomorphism 
X:M— *K*, where K* is the multiplicative group of all nonzero elements 
of K is called a character of M in K. 0 

Note that an embedding cr:E— >L of fields defines a character 
<t:E*—L*. 

Theorem 2.8.6 (E. Artin) Any set *5 of distinct characters of M in K is 
linearly independent over K. 

Proof. Suppose to the contrary that 



a 1 Xi+-" + a n X„ = 0 

for Xi € ^ and <*[ € K, not all 0. Look among all such nontrivial linear 
combinations of the Xi’ s for one with the fewest number of nonzero 
coefficients and, by relabeling if necessary, assume that these coefficients 
are Thus, 

(2-8.1) <* 1 xi(g)+--- + a r x r (g) = 0 

for all g £ M and this is the “shortest” such nontrivial equation (hence 
c*i ^ 0 for all i). Note that since Xi(g) € K*, we have Xi(g) ^ 0 for all 
g € M. Hence, r > 1. 

Since X\ / X r > there is a g E M for which Xi(g) ^ X r (g)« For any 
h G M, we have 

«lXi(gh) + • • • + a r x r (gh) = 0 



that is, 



a lXi(g)Xi(h) + • • • + a r x r (g)x r (h) = 0 
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Multiplying (2.8.1) by Xi(g) gives 

<*i*i(g)Xi(h) + ■ • • + a r Xi(g)x r (h) = 0 
Subtracting the previous two equations gives 

« 2 [Xi(g) - x 2 (g)]x 2 ( h ) + • • • + a r [Xi(g) - x r (g)]x r (h) = o 

and since the last coefficient is not zero, this contradicts the minimal 
nature of (2.8.1). Hence the characters are linearly independent. I 

Corollary 2.8.7 (Dedekind Independence Theorem) Let E and L be 
fields. Any set of distinct embeddings of E into L is linearly 
independent over L. 0 



2.9 Splitting Fields and Normal Extensions 

Let us repeat a definition from Chapter 1. 

Definition Let ®F = {fj(x) | i G 1} be a family of polynomials in F[x]. A 
splitting field for over F is an extension field E of F with the property 
that each fj(x) splits in E and that E is generated by the set of all roots 
of every polynomial in ^F. 0 

It is clear that, given a particular algebraic closure F of F, there is a 
unique splitting field for *5 in F, since that splitting field must be the 
field generated by the roots in F of all polynomials in < J. It is also true 
that any two splitting fields for *F are isomorphic by an isomorphism 
that fixes the elements of the base field F. 

Theorem 2.9.1 Let p(x) G F[x], Any two splitting fields for p(x) over F 
are isomorphic over F. Specifically, if S 1 and S 2 are splitting fields for 
p(x) over F and if cr^— >S 2 is an F-embedding of Sj into an algebraic 
closure of S 2 then a is an isomorphism of S l onto S 2 . 

Proof. By Theorem 2.4.8, we may extend the inclusion map j: F— >S 2 to 
an embedding cr:S 1 — >S 2 over F. For any such embedding, let Rj be the 
set of distinct roots of p(x) in Sj. Then p*(x) = p(x) implies that 
( 7 R 1 C R 2 . But a is injective and each set Rj is finite, whence by 
symmetry, we have <jR 1 = R 2 . It follows that 

^ = (T^R,)] = F^R,) = F(R 2 ) = S 2 



and so <r is an isomorphism. I 




2 Field Extensions 



53 



This result also holds for arbitrary families of polynomials. 

Theorem 2.9.2 Let 5 be a family of polynomials over F. Any two 
splitting Fields for are isomorphic over F. Specifically, if S 1 and S 2 are 
splitting fields for over F and if (TiSj— >S 2 is an F-embedding of S 1 
into an algebraic closure of S 2 then a is an isomorphism of Sj onto S 2 . 

Proof. As in the proof of the previous theorem, we have an embedding 
<r:S 1 “»S 2 . Let E 1 < Sj and E 2 < S 2 be splitting fields for a polynomial 
p(x) in Theorem 2.9.1 implies that the restriction of a to E 1 is an 
isomorphism, whence crE 1 = E 2 . Taking the composite over the splitting 
fields E 1 in S 1 of all polynomials in gives 

< tS 1 = <t( V E x ) = V aE 1 = V E 2 = S 2 I 

Recall that if F < E is algebraic then E is an algebraic closure of F if 
and only if any nonconstant polynomial p(x) over F splits in E. Perhaps 
the next best thing would be that every irreducible polynomial p(x) 
over F that has one root in E splits in E. This property happens to 
characterize splitting fields. 

Theorem 2.9.3 Let F < E be algebraic and let F < E < F. The following 
are equivalent. 

1) E is a splitting field for a family of polynomials over F. 

2) Every embedding of E into F over F is an automorphism of E. 

3) Every irreducible polynomial over F that has one root in E splits 
in E. 

Proof. [1=>2] Let cr be an embedding of E into F over F. Since E is a 
splitting field for a family *5 of polynomials over F, we have E = F(R), 
where R is the set of roots of the members of Since a acts as a 
permutation on the roots of any member of we have <rR = R and so 

<tE = cr(F(R)) = F(crR) = F(R) = E 

[2=>3] Let f(x) be an irreducible polynomial over F, with a root ot in 
E. According to Theorem 2.8.4, if /? G F is a root of f(x), then the 
injection j.F— >F can be extended to an embedding <r.E— »F for which 
aa = /?. By hypothesis, a is an automorphism of E, whence /3 is also in 
E. Thus, f(x) splits in E. 

[3=>1] This follows immediately, since E is a splitting field for the 
family = {ram(c*, F) | a G E}. I 
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Definition An algebraic extension F < E that satisfies any (and hence 
all) of the conditions in the previous theorem is said to be a normal 
extension. We also say that E is normal over F. □ 

Corollary 2.9.4 If F < E is a finite normal extension then E is the 
splitting field of a finite family of irreducible polynomials. 

Proof. Let E = F(a 1 ,...,a n ). Since E is normal over F, each minimal 
polynomial min(a v F) splits in E. Clearly, E is generated by the roots of 
min{a v F) and so E is the splitting field of the finite family 
{min{a v F)}. I 

Note that any extension F < E, with E algebraically closed, is normal 
since any nonconstant p(x) G F[x] splits in E. 

As it happens, the class of normal extensions is not distinguished. 

Example 2.9.1 It is not hard to see that any extension of degree 2 is 
normal. The extension Q < Q( y/2) is not normal since Q( y/2) contains 
exactly two of the four roots of x 4 — 2, which is irreducible over Q. On 
the other hand, 

Q < Q(>/2) < Q(V2) 

with each step of degree 2 and therefore normal. As another example, 
since C is algebraically closed, Q < C is normal but Q < Q( \/2) is not 
normal. 0 

The previous example notwithstanding, many of the properties that 
define distinguished classes do hold for normal extensions. 

Theorem 2.9.5 

1) If F < E is normal and F < K < E then K < E is also normal. 

2) The class of normal extensions is closed under lifting: If F < E is 
normal and F < K is any extension then K < EK is normal. 

3) The class of normal extensions is closed under the taking of 
arbitrary composites and intersections: If {Ej is a family of fields, 
each normal over F, and each contained in a single larger field, 
then V Ej is normal over F and f] Ej is normal over F. 

Proof. Part 1) follows from the fact that a splitting field for a family of 
polynomials over F is also a splitting field for the same family of 
polynomials over K. For part 2), let E be a splitting field for a family *5 
of polynomials over F and let R be the set of roots in E of all 
polynomials in < T. Then E = F(R). Hence, EK = K(R), which shows 
that EK is a splitting field for the family thought of as a family of 
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polynomials over K. Hence, K < EK is normal. For part 3), let 
c t : V Ej — >F be an embedding over F. Then cr is an embedding when 
restricted to each Ej and so crEj = Ej, whence 

cr( V Ej) = V (tEj = V Ej 

and 

^(nE i ) = n^E i = nE i ■ 

Normal Closures 

Definition Let F < E be algebraic and let F be an algebraic closure of F 
containing E. The normal closure of F < E in F is the intersection of all 
fields L such that E < L < F and F < L normal. We denote this field by 
E nc . □ 



Note that since F < F is normal, the intersection described in the 
previous definition is a nontrivial one. 

Theorem 2.9.6 Let F < E < F be algebraic, with normal closure E nc . 

1) E nc is the smallest subfield of F with the property that E < E nc 

and F < E nc is normal. _ 

2) E nc = V <rE, over all cr E # 0 mp(E,F). 

3) E nc is the splitting field in F of the family {min(c*,F) | a E E}. 

4) E nc is the splitting field in F of the family {min( <*,F) | a E B} 
where B is a basis for E over F. 

5) If F < E is finite, then F < E nc is also finite. 

Proof. We prove only part 2), leaving the rest for the reader. Let 
E < L < F with F < L normal. Since E < L is algebraic, any a E 
#omp(E,F) may be extended to an embedding r:L— >F over F. Since 
F < L is normal, r is an automorphism of L. It follows that crE C L and 
so V <tE < E nc . On the other hand, if we let L = V crE, then F < L is 
normal since if r E #orap(L,F) then rcr runs over all elements of 
#omp(L,F) as a does and so 

rL = t( V crE) = V rcr(E) < V crE = L 

Since F < L is algebraic, we deduce that rL = L, that is, r is an 
automorphism of L over F. Hence, F < L is normal and so E nc < L = 
V <tE. This shows that V <tE = E nc . I 
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Exercises 

1. Let R be an integral domain containing a field F. Show that if 
[R:F] < oo then R must be a field. 

2. If F < E is algebraic and R is a ring such that F C R C E, show 
that R is a field. Is this true if F < E is not algebraic? 

3. Let F < E and F < K be finite extensions and assume that EK is 
defined. Show that [EK:F] < [E:F][K:F], with equality if [E:F] and . 
[K:F] are relatively prime. 

4. Let <t:K— >E be a homomorphism of fields and let F < K fl E. Show 
that cr is F-linear if and only if cr(a) = a for all a E F. 

5. Let F < E be a quadratic extension , that is, an extension of degree 
2. Show that E has a basis over F of the form {l,a} where a 2 E F. 

6. Prove that any extension of degree 2 is normal. 

7. Let F be an infinite field and let F < E be an algebraic extension. 
Show that | E | = | F | . 

8. Let F < F where F is an algebraic closure of F and let G = 
i4w/ F (F) be the group of all automorphism of F fixing F pointwise. 
Let 

F G = {a G F | <ra = a for all a E G} 

be the fixed field of F under G. Evidently F < F G < F. Show that 
the minimal polynomial of any a E F G over F has only one 
distinct root in F. Show also that the minimal polynomial of any 
a E F over F G has no multiple roots. Hint: for the latter 
statement, consider the polynomial p(x) = n ( x — a i) where aj are 
the distinct roots of m*n(a,F G ). 

9. Let p be a prime and let a ^ 1 be a complex p-th root of unity. 

Show that min(a,Q) = 1 + x + x 2 H bx 1 *” 1 . What is the 

splitting field for x p — 1 over Q? 

10. Suppose that F < E is a finite extension and that E = F(S) for 
some set SCE. Must there exist a finite subset S 0 CS for which 
E = F(S 0 )? 

11. Let F be a field of characteristic p ^ 0 and let a E F. Show that 
the following are equivalent: (i) a p E F (ii) F(a p ) = F (iii) 
[F(a)] pk C F where [F(c*)] p = {s p | s E F(a)}. 

12. Let F < E be a finite normal extension and let p(x) E F[x] be 
irreducible. Suppose that the polynomials f(x) and g(x) are monic 
irreducible factors of p(x) over E. Show that there exists a a E 
Autp(E) for which f^x) = g(x). 

13. Let F < E be algebraic. Show that a normal closure for F < E 
exists and that any two normal closures are isomorphic over F. 
Show also that if F < E is finite, so is F < E nc . If F < E is 
algebraic and cr E Hom F ( E,E) then Im a is contained in the normal 
closure of F < E that lies in E. 
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14. Let F be a field and let a v ... <> ct n be distinct elements of F. Prove 

that if ajCtj H b a n a n = ® f° r Integers k > 0 then aj = 0 for 

all i. 

15. Show that an extension F < E is algebraic if and only if any 
subalgebra S of E over F is actually a subfield of E. 

16. Let F < E be normal. Can any automorphism of F be extended to 
an automorphism of E? 

17. Suppose that F and E are fields and a:F— »E is an embedding. 
Construct an extension of F that is isomorphic to E. 



Constructions 

The goal of the following series of exercises is to prove that certain 
constructions are not possible using straight edge and compass alone. In 
particular, not all angles can be trisected, a circle cannot be “squared” 
and a cube cannot be “doubled.” The first step is to define the term 
constructible . We assume the existence of two distinct points P 1 and P 2 
and take the distance between these points to be one unit. 

Definition A point, line or circle in the plane is said to be constructible 
if and only if it can be obtained by a finite number of applications of 
the following rules. 

1) Pj and P 2 are constructible. 

2) The line through any two constructible points is constructible. 

3) The circle with center at one constructible point and passing 
through another constructible point is constructible. 

4) The points of intersection of any two constructible lines or circles 
are constructible. □ 

Cl. Show that if a line L and point P are constructible, then the line 
through P perpendicular to L is also constructible. 

C2. Show that if a line L and point P are constructible, then the line 
through P parallel to L is also constructible. 

C3. Taking the constructible line through P l and P 2 as the x-axis and 
the point Pj as the origin, the y-axis is also constructible. Show 
that any point (a,b) with integer coordinates is constructible. 

C4. Show that the perpendicular bisector of any line segment 
connecting two constructible points is constructible. 

C5. If P, Q and R are constructible points and L is a constructible line 
through R then a point S can be constructed on L such that the 
distance from S to R is the same as the distance from P to Q. 
(Thus, given distances can be marked off on constructible lines.) 
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Definition A real number r is constructible if its absolute value is the 
distance between two constructible points. 0 

C6. Show that any integer is constructible. 

C7. Prove that a point (a,b) is constructible if and only if its 
coordinates a and b are constructible real numbers. 

C8. Prove that the set of numbers that are constructible forms a 
subfield of the real numbers containing Q. Hint : to show that the 
product of two constructible numbers is constructible or that the 
inverse of a nonzero constructible number is constructible, use 
similar triangles. 

C9. Prove that if a > 0 is constructible, then so is yfa. Hint: first 
show that a circle of diameter 1 + a is constructible and that a 
line L through the center of the circle is constructible. Let P and 
Q be the intersection points of the circle with the line L. Mark off 
a units along the diameter PQ from P and denote that point by 
R. Is R constructible? Construct a line M through R perpendicular 
to L. Let S be one point of intersection of M and the circle. What 
is the length of the line segment RS? 

The two previous exercises prove the following theorem. 

Theorem Cl If the elements of a field F < R are constructible, and if 
a E F, then F(y/a) = {a + by/a | a, b E F} is constructible. 0 

Theorem C2 Let F be a subfield of R and let E > F be a quadratic 
extension. Then E = F(^/a) for some a E F. 

Proof. Exercise. I 

It follows from the two previous theorems that if F is constructible 
and if F < E is a quadratic extension then E is constructible. More 
generally, we have 

Theorem C3 If Q < E x < E 2 < • • • < E n is a tower of fields, each one a 
quadratic extension of the previous one then every element of E n is 
constructible. 0 

We now turn to a converse of Theorem C3. 

Theorem C4 Let four constructible points, whose coordinates lie in a 
field F < R, be given. Let L and M be lines or circles constructed from 
these points. Then the points of intersection of L and M have 
coordinates in a quadratic extension of F. 

Proof. Exercise. I 

The import of the previous theorem is that each time we construct a 
constructible number a, the number lies in a quadratic extension of the 
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field of previously constructed numbers. Thus, we have 

Theorem C5 A real number is constructible if and only if it lies in a 
field E n that is at the top of a tower of fields 



Q<E x <E 2 < ■■■<E n 

each one a quadratic extension of the previous one. Hence, if a is 
constructible, then [Q(c*):Q] must be a power of 2. 0 

Now consider what it means to say that an angle of 0° is 
constructible. Informally, we will take this to mean that we may 
construct a line L through the origin that makes an angle of 0° with the 
x-axis. 



CIO. Show that such a line L is constructible if and only if the real 
number cos 0° is constructible. (This is an informal demonstration, 
since we have not formally defined angles.) 

The previous exercise prompts us to make the following definition. 

Definition An angle of 0 ° is constructible if the real number cos 0 is 
constructible. D 

Cll. Show that a 60° angle is constructible. 

Cl 2. Show that a 20° angle is not constructible. Hint verify the formula 

cos 30 = 4 cos 3 0 — 3 cos 0 
Let r = cos 20° and show that r is a root of 
p(x) = 8x 3 — 6x — 1 

Show that p(x) is irreducible over Q and so [Q(r):Q] = 3. 

C13. Prove that every constructible real number is algebraic over Q. 
Assuming that 7r is transcendental over Q, show that any circle 
with a constructible radius cannot be “squared,” that is, a square 
cannot be constructed whose area is that of a unit circle. 

C14. Verify that it is impossible to “double” any cube whose side length 
r is constructible, that is, it is impossible to construct an edge of a 
cube whose volume is twice that of a cube with side length r. 




Chapter 3 

Algebraic Independence 



In this chapter, we discuss the structure of an arbitrary field 
extension F < E. Specifically, we will see that, for any extension F < E, 
there exists an intermediate field F < F(S) < E whose second step 
F(S) < E is algebraic and whose first step F < F(S) is purely 
transcendental The latter means that there is no nontrivial polynomial 
dependency (over F) among the elements of S, and so these elements 
act as “independent variables” over F. Thus, F(S) is the field of all 
rational functions in these variables. 



3.1 Dependence Relations 

We begin with a general notion of dependence, intended to model 
linear independence. 

Definition Let X be a nonempty set and let A C X x ^P(X) be a binary 
relation from X to the power set of X. We write x -< S (read: x is 
dependent on S) for (x,S) £ A and S -< T when s -< T for all s £ S. Then 
A is a dependence relation if it satisfies the following properties, for all 
S, T and U £ ?P(X), 

1) (reflexivity) 

S-^S 

2) (compactness) 

x -< S => x -< S 0 for some finite subset S 0 of S 
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3) (transitivity) 

S^T, T ^ U => S ^ U 

4) (Steinitz exchange axiom) 

x -< S, x -j< S — {s} => s -< (S — {s}) U {x} 

If x -|< S we say that x is independent of S. D 

Definition A subset S C X is dependent if s -< S — {s} for some s G S 
(equivalently, if S -< S — {s}). A subset SCX is independent if 
s -(< S — {s} for all s € S. (Hence the empty set is independent.) 0 

The reader should have no trouble supplying a proof for the following 
lemma. 

Lemma 3.1.1 

1) If S -< T then S -< T' for any superset T' of T. 

2) Any superset of a dependent set is dependent. 

3) Any subset of an independent set is independent. 

4) If S is a dependent set, then some finite subset S 0 of S is 
dependent. Equivalently, if every finite subset of T is independent, 
then T is independent. 0 

Theorem 3.1.2 If S is independent and x -|< S then S U {x} is 
independent. 

Proof. Let s G S. If s -< (S U {x}) — {s} then since s-|<S — {s}, the 
exchange axiom would imply that x S, a contradiction. Hence 
s -(< (S U {x}) — {s}. Furthermore, by hypothesis x -|< S = (S U {x}) — {x}. 
Thus, S U {x} is independent. I 

Definition A set B C X is called a base if B is independent and X B. □ 

Theorem 3.1.3 Let X be a nonempty set with a dependence relation -< . 

1) B C X is a base for X if and only if it is a maximal independent 
set in X. 

2) B C X is a base for X if and only if B is minimal with respect to 
the property X -< B. 

3) Let A C S C X, where A is an independent set (possibly empty) 
and X -< S. Then there is a base B for X such that A C B C S. 

Proof. For part 1), assume B is a base. Then B is independent. If B is 
not maximal independent, there exists an x G X — B for which B U {x} 
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is independent. Hence, x -(< (B U {x}) — {x} = B, a contradiction to 
X B. For the converse, if B is a maximal independent set and x -{< B 
then B U {x} is independent, which is not the case. Hence, X -< B and B 
is a base. 

For part 2), if B is a base, then X -< B. Suppose that some proper 
subset B 0 CB satisfies X -< B 0 . If b G B - B 0 then b -< B 0 -< B — {b}, 
contracting the independence of B. Hence B is minimal. Conversely, 
suppose that B is minimal with respect to the property X -< B. If B is 
dependent then X -< B -< B — {b} for some b E B, a contradiction to the 
minimality of B. Hence B is independent and a base for X. 

For part 3), we apply Zorn’s lemma. The set if of all independent 
sets B in X satisfying A C B C S is nonempty, since A E if. Order if by 
set inclusion. If C = {CJ is a chain in If, then the compactness property 
implies that the union (JCj is an independent set, which also lies in if. 
Hence, Zorn’s lemma implies the existence of a maximal element C E if, 
that is, C is independent, ACCCS and C is maximal with respect to 
these two properties. This maximality implies that S -< C and so 
X -< S -< C, which implies that C is a base. I 

To prove that any two bases for X have the same cardinality, we 
require a lemma. 

Lemma 3.1.4 Let S be a finite dependent set and let ACS be an 
independent subset of S. Then there exists a E S — A for which 
S -< S — {a}. 

Proof. Among all subsets of S — A, choose a maximal one B for which 
AUB is independent. Then B is a proper (perhaps empty) subset of 
S — A. If a E S — (A U B) then a-<AUB-<S — {a} and so S -< S — {a}.| 

Theorem 3.1.5 Any two bases for a set X have the same cardinality. 

Proof. Let B and C be bases for X. We first assume that at least one of 
B or C is finite; say B = {b 1 ,...,b m } is finite. Choose c x E C. The set 
Cj = {c 1 ,b 1 ,...,b m } satisfies the conditions of the previous lemma 
(with A = {cj}) and so, after renumbering the bj’s if necessary, we 
deduce that 



X^C^ { c i>b 1 ,...,b m _ 1 } 

For any c 2 EC — {cj}, the set C 2 = {c 1 ,c 2 ,b 1 ,. . .,b m-1 } satisfies the 
conditions of the lemma (with A = {c 1? c 2 }) and so, again after possible 
renumbering, we get 

X -< C 2 -< {c 1 ,c 2 ,b 1 , . . . , b m __ 2 } 
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Continuing this process, we must exhaust all of the elements of C before 
running out of elements of B, for if not, then a proper subset C' of C 
would have the property that X -< C', in contradiction to the 
independence of C. Hence, | C | < | B | . Since this shows that C is 
finite, we may repeat the argument with the roles of B and C reversed 
to get | B | = | C | . 

Let us now assume that B and C are both infinite sets, and let C = 
{cj | a G I}. Thus, | C | = 1 1 1 . For each b G B, we have b -< C and so 
there is a finite subset I b C I such that b -< {cj | i G I b }. This gives a 
map b— >I b from B to the set of finite subsets of the index set I. 
Moreover, 



for if j G I — U Ib th en > f° r any b G B, we have 



b -€ {Ci | i € I„> -€ C - {Cj> 



and so Cj -< B -< C — {cj}, which contradicts the independence of C. 
Hence, 



Cl = |I| = u lb <«o|B| = | B I 
'b 6 B 1 



Again reversing the roles of B and C shows that | B | = | C | . I 



3.2 Algebraic Dependence 

We recall a definition. 

Definition Let F < E. An element t G E is transcendental over F if t is 
not algebraic over F, that is, if there is no nonzero polynomial p(x) G 
F[x] such that p(t) = 0. Q 

Recall that if t is transcendental over F then F(t) is the field of all 
rational functions in the variable t, over the field F. 

Definition Let F < E and let S C E. An element a G E is algebraically 
dependent on S over F, written a -< S, if a is algebraic over F(S). If a is 
not algebraically dependent on S over F, that is, if a is transcendental 
over F(S) then a is said to be algebraically independent of S over F and 
we write a -|< S. 0 

The first order of business is to show that algebraic dependence is a 
dependence relation. 




3 Algebraic Independence 



65 



Theorem 3.2.1 Algebraic dependence is a dependence relation. 

Proof. Since any s E S is algebraic over F(S) we have S S. To show 
compactness, let a -< S and let C be the set of coefficients of 
mm(c*,F(S)). Since C C F(S), each c E C is a rational function over F in 
a finite number of elements of S and so there is a finite subset S 0 of S 
for which C C F(S 0 ). Hence a is algebraic over F(S 0 ), that is, a ~< S 0 . 

For transitivity, suppose that a -< S and S -< T and consider the 
tower 

F(T) < F(T U S) < F(T U S,a) 

Since every element of S is algebraic over F(T), and since a is algebraic 
over F(TUS) we deduce that a is algebraic over F(T), whence a -< T. 

Finally, we verify the exchange axiom. Suppose that a -< S and 
ol -|< S — {s}. Let p(x) = min(a, F(S)). Since F(S) = F(S — {s})(s), the 
coefficients of p(x) are polynomials in s over F(S — {s}), that is, 

p( x ) = S f i( s ) x ' 

i=0 

where we may assume that f d (x) ^ 0. Hence, the polynomial 

p( x >y) = 2 f i( y ) xi 

i=0 

in two independent variables is not the zero polynomial. This 
polynomial can also be written 

p( x >y) = igi( x )y’ 

i=0 

where g;(x) E F(S — {s})[x] and g e (x) ^ 0. Then 

e 

0 = p(a,s) = ^2 8i( a ) s ‘ 
i=0 

Since g e (x) E F(S — {s})[x] is nonzero and a is transcendental over 
F(S — {s}), we infer that g e (a) ^ 0 and e > 0. Hence, the equation 
above shows that s -< F(S — {s} U {a}). I 

We may now take advantage of the results derived for dependence 
relations. 

Definition Let F < E. 

1) A subset S C E is algebraically dependent over F if s -< S — {s} for 
some s E S, that is, if s is algebraic over F(S — {s}) for some s E S. 
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2) A subset S C E is algebraically independent over F if s -]< S — {s} 
for all s G S, that is, if s is transcendental over F(S — {s}) for all 
s G S. (Hence the empty set is algebraically independent over F.) 0 

Lemma 3.2.2 

1) Any superset of an algebraically dependent set is algebraically 
dependent. 

2) Any subset of an algebraically independent set is algebraically 
independent. □ 

Theorem 3.2.3 If S is algebraically independent over F and a is 
transcendental over F(S) then S U {a} is algebraically independent over 
F.D 

Let us provide another characterization of algebraically dependent 
sets. 



Theorem 3.2.4 Let F < E. A subset S of E is algebraically dependent 
over F if and only if there is some nonzero polynomial p(x 1 ,...,x n ) in 
n > 1 variables over F for which p(s 1 ,...,s n ) = 0, for distinct Sj G S. 



Proof. Suppose first that S is algebraically dependent over F. Then 
some s G S is algebraic over F(S — {s}) and so there exists a polynomial 
p(x) of degree d > 0 over F(S — {s}) for which p(s) = 0. Such a 
polynomial has the form 



pM = D 

i=0 



Pi( s l»---»s m ) jL i 



where p^Xj,. . .,x m ) and q^Xj,. . .,x m ) are polynomials in m variables 
and the Sj G S — {s} are distinct. Note that Pd( s iv> s m ) ^ 0 and 
q^Sj, . . . ,s m ) ^ 0 for all i. Letting x = s and clearing this of 
denominators gives 



d 

°= ]C r i( S l>”-- S m) si 

i=0 

for polynomials r^Xj, . . . , x m ), with T d (s v . . . , s m ) ^ 0. Thus 
T d( x v mm '* x nd not zero polynomial and p(x) = ^r i (x 1 ,...,x m )x 1 
is a nonzero polynomial satisfied by the m + 1 distinct elements 
sp.^s^s in S. 

For the converse, suppose that p(s 1 ,. . .,s n ) = 0 for distinct Sj G S, 
where p(x 1 ,...,x n ) is a nonzero polynomial over F. We may assume 
without loss of generality that s 2 ,...,s n do not enjoy a similar 
polynomial dependency and hence that 
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d 

p(x l5 • • • , x n ) = Pi( x 2 > • • • - XnKl 
i=0 

where p d (x 2 , . . . , x n ) ^ 0 and p d (s 2 , . . . , s n ) ^ 0. Hence the nonzero 
polynomial 

d 

P( x ) = ]CPi( S 2«---’ S n) xl 

i=0 

satisfies p(s 1 ) = 0, showing that Sj is algebraic over F(S — {sj}) and 
hence that S is algebraically dependent over F. I 

Corollary 3.2.5 Let F < E and let S = {s 1 ,...,s n } be a subset of E. Then 
S is algebraically independent over F if and only if s m is transcendental 
over F(s 1 ,...,s m _ 1 ) for all m = 1, . . . , n. 

Proof. If S is algebraically independent then s m is transcendental over 
F(S — {s m }) and therefore also over the smaller field F(s 1? . . . ,s m _ 1 ). For 
the converse, if S is algebraically dependent then there is a nonzero 
polynomial dependency of the form 

d 

°= X^Pi( s i<--->s m -iKi 

i=0 

for some m < n where p^Sj,. . ^ 0, whence s m is algebraic over 

F(s 1 ,...,s m _ 1 ). This contradiction implies that S is algebraically 
independent. I 



3.3 Transcendence Bases 

Definition Let F < E. A transcendence basis for E over F is a subset 
BCE that is algebraically independent over F and for which E -< B, 
that is, for which F(B) < E is algebraic. □ 

Since algebraic dependence is a dependence relation, we immediately 
get the following two results. 

Theorem 3.3.1 Let F < E. A subset B C E is a transcendence basis for E 
over F if and only if it satisfies either one of the following. 

1) B is a maximal algebraically independent subset of E over F. 

2) B is minimal with respect to the property that F(B) < E is 
algebraic. 0 
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Theorem 3.3.2 Let F < E. 

1) Any two transcendence bases for E over F have the same 
cardinality. This cardinality is called the transcendence degree of 
E over F, and is denoted by [E:F] t . 

2) Suppose FCACSCE where A is algebraically independent over 
F and F(S) < E is algebraic. Then there exists a transcendence 
basis B for E over F satisfying A C B C S. In particular, 
[E:F] t < | S | . □ 

Definition An extension F < E is said to be purely transcendental if 
E = F(B) for some transcendence basis B for E over F. 0 

We remark that if E is purely transcendental over F then E = F(B) 
for some transcendence basis B, but not all transcendence bases for E 
over F need generate E. The reader is asked to supply an example in 
the exercises. 

The following few simple results concerning transcendental extensions 
will prepare the way to finishing the proof (promised in Chapter 2) that 
the class of finitely generated extensions is distinguished. 

Corollary 3.3.3 If E is finitely generated over F and B is a 
transcendence basis for E over F then B is a finite set and F(B) < E is a 
finite extension. 

Proof. Theorem 3.3.2 implies that B is finite. The second part follows 
from the fact that E is finitely generated over F(B) as well, and a 
finitely generated algebraic extension is finite. I 

Theorem 3.3.4 Let F < K < E and suppose that F < K is algebraic. If 
TCE is algebraically independent over F, then T is also algebraically 
independent over K. In other words, T remains algebraically 
independent over any algebraic extension of the base field. 

Proof. If T is not algebraically independent over K, there exists t E T 
algebraic over K(T — {t}). Since F < K is algebraic, we deduce that 
F(T — {t}) < K(T — {t}) is algebraic, and so each step in the tower 

F(T - {t}) < K(T - {t}) < K(T - {t})(t) = K(T) 

is algebraic, whence t E K(T) is algebraic over F(T — {t}), in 
contradiction to the algebraic independence of T over F. I 

We are now in a position to finish the proof that the class of finitely 
generated extensions is distinguished. Note how much more involved 
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this task is than showing that finite or algebraic extensions are 
distinguished. 

Theorem 3.3.5 Let F < K < E. If E is finitely generated over F then K 
is also finitely generated over F. Thus, the set of finitely generated 
extensions is distinguished. 

Proof. Let S = {s 1 ,...,s k } be a transcendence basis for K over F. Then 
the second step in the tower F < F(S) < K < E is algebraic and E is 
finitely generated over F(S). Hence, if we can prove the theorem for 
algebraic intermediate fields, we will know that K is finitely generated 
over F(S) and therefore also over F, since S is a finite set. 

Thus, we may assume that F < K < E with F < K algebraic and 
show that [K:F] is finite. Let T = {t-^, . . . , t n } be a transcendence basis 
for E over F. Our plan is to show that 

[K:F] < [E:F(T)] 

(see Figure 3.3.1) by showing that any finite subset of K that is linearly 
independent over F is also linearly independent over F(T) [as a subset 
of E]. Since [E:F(T)] is finite by Corollary 3.3.3, the proof will be 
complete. 



E 




Figure 3.3.1 

First, we observe that, by Theorem 3.3.4, since T is algebraically 
independent over F, it is also algebraically independent over the 
algebraic extension K of F. 

Let Y = {y-|, . . .,y m } C K be linearly independent over F. Suppose 
that 

£r i (ti,...,t n )y i = 0 

where r^tj,. . .,t n ) £ F(T). By clearing denominators if necessary, we 
may assume that each rj(t 1? . . . ,t n ) is a polynomial over F. Collecting 
terms involving like powers of the tj’s gives 
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, S,(Sn v' y ') t ' 1 ' 

e l’ ' " ’ e n > 



< n = o 



where a jGF is the coefficient of tj 1 ---^ 11 in r^tj,. . . ,t n ). Since 

T is algebraically independent over K, the products tj 1 - • -t* n are linearly 
independent over K, and hence also over F(y 1 ,...,y ) C K. Thus 

ES e n ,i y i = ° 



and the linear independence of Y over F then implies that 



Hence r^tj,. . . ,t n ) = 0 for all i. This shows that Y is linearly 
independent over F(T), as desired. I 

The next theorem gives some verisimilar facts about simple 
transcendental extensions; in particular, if E = F(t), where t is 
transcendental over F, then any nonconstant rational function in t is 
also transcendental over F and E is algebraic over any intermediate 
field other than the base field F. 



Theorem 3.3.6 

1) Suppose that E = F(t), where t is transcendental over F. If s = 
f(t)/g(t) E F(t) where f(t) and g(t) are relatively prime and at 
least one is nonconstant, then s is transcendental over F, t is 
algebraic over F(s) and [F(t):F(s)] = max(deg f(t), deg g(t)). 

2) If t is transcendental over F then F(t) is algebraic over any field K 
satisfying F < K < F(t), K ^ F. 

3) If F < E is purely transcendental then any a E E — F is 
transcendental over F. 

Proof. For 1), if we show that t is algebraic over F(s), it will follow that 
s is transcendental over F, for otherwise F < F(s) < F(t) would be an 
algebraic tower. The polynomial 

p(x) = g(x)s-f(x) 6 F(s)[x] 

has the property that p(t) = g(t)s — f(t) = 0. Moreover, p(x) is 
irreducible over F(s). For if we think of p(x) as a polynomial in the two 
(independent) variables x and s, it is clear that if p(x) has a nontrivial 
factorization, one of the factors must be a nontrivial common factor of 
f(x) and g(x), which is impossible. Since 
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deg p(x) = max(deg f(x), deg g(x)) 

part 1) is proved. Part 2) follows easily from part 1). 

For part 3), if c*GE — F then a E F(t 1 ,...,t n ) for some finite set 
{tj,...,t n } of algebraically independent elements. By part 1), every 
element of F(t 1 ) not in F is transcendental over F. Similarly, every 
element of F(t 1 ,t 2 ) not in F(t x ) is transcendental over F(t x ) and hence 
also over F. Continuing this argument gives the desired result. I 

We leave it as an exercise to show that the converse of part 3) is 
false, that is, there exist extensions F < E that are not purely 
transcendental but for which every a E E — F is transcendental over F. 

The following is an example of an extension that is neither algebraic 
nor purely transcendental. 



Example 3.3.1 Let n > 3 and let F be a field with char(F)/fn. Let u be 
transcendental over F, let v be a root of p(x) = x n + u n — 1 in some 
splitting field and let E = F(u,v). Clearly, E is not algebraic over F. We 
contend that E is also not purely transcendental over F. Since v is 
algebraic over F(u), we deduce that {u} is a transcendence basis for E 
over F and so [E:F] t = 1. If E were purely transcendental over F there 
would exist a transcendental element t over F for which F(t) = F(u,v). 
Let us show that this is not possible. 

If F(t) = F(u,v) then 



a(t) c(t) 

u= b(t) Md v= a(t) 



where a(t), b(t), c(t) and d(t) are polynomials over F. Hence 

a n (t) c n (t) 
b n (t) + d n (t) 
or 

[a(t)d(t)] n 4- [b(t)c(t)] n = [b(t)d(t)] n 



This can be written 



Ht)+g n (t) = h n (t) 

for nonconstant polynomials f(t), g(t) and h(t), which we may assume 
to be pairwise relatively prime. Let us assume that deg f(t) < deg g(t), 
in which case deg h(t) < deg g(t). We now divide by h n (t) and take the 
derivative with respect to t to get (after some simplification) 

f n-1 [fh - fb'] + g n_1 [g'h - gh'] = 0 
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Since f and g are relatively prime, we deduce that g n 1 | f'h — fh\ But 
this implies 

(n — \)deg g < deg fh — 1 = deg f + deg h — 1 < 2 deg g — 1 

which is not possible for n > 3. Hence, F < F(u,v) is not purely 
transcendental. 0 

While the vector space dimension is multiplicative over a tower of 
fields, the transcendence degree is additive, as we see in the next 
theorem. 

Theorem 3.3.7 Let F < K < E. 

1) If S C K is algebraically independent over F and T C E is 
algebraically independent over K then SUT is algebraically 
independent over F. 

2) If S is a transcendence basis for K over F and T is a transcendence 
basis for E over K then S U T is a transcendence basis for E over 
F. 

3) [E:F] t = [E:K] t + [K:F] t 

Proof. For part 1), suppose for the purposes of contradiction that SUT 
is algebraically dependent over F. Then there exists an a £ S U T that is 
algebraic over F(S 0 UT 0 ) for some finite sets S 0 CS and T 0 C T not 
containing a, and we may assume that no proper subset T 1 of T 0 has 
the property that a is algebraic over F(S 0 UT 1 ). If a E T then since a is 
algebraic over F(S 0 UT 0 ), it is also algebraic over the larger field 
K(T — {c*}), in contradiction to the algebraic independence of T over K. 
Hence a £ T and so a £ S. But then T 0 cannot be empty, since S is 
algebraically independent over F. If t £ T 0 then the minimality of T 0 
implies that a is not algebraic over S 0 UT 0 — {t}, that is, 
a -|< SqUTq — {t}. But a?-<SoUTo and so the exchange axiom gives 
t -< S 0 U T 0 U {a} — {t}. In other words, t is algebraic over 
F(S 0 UT 0 U{a} - {t}), and hence also over the larger field K(T~ {t}), 
again contradicting the algebraic independence of T over K. This proves 
part 1). 

For part 2), we know by part 1) that SUT is algebraically 
independent over F. Also, since F(S) < K and K(T) < E are algebraic, 
each step in the tower F(S U T) < K(T) < E is algebraic and so 
F(SUT) < E is algebraic. Hence, SUT is a transcendence basis for F 
over E. Part 3) follows directly from part 2). I 
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*3.4 Simple Transcendental Extensions 

The class of purely transcendental extensions is much less well 
behaved than the class of algebraic extensions. For example, let t be 
transcendental over F. Then in the tower F < F(t 2 ) < F(t), the 
extension F < F(t) is purely transcendental (and simple) but the second 
step F(t 2 ) < F(t) is not transcendental at all. 

In addition, if F < E is purely transcendental and F < K < E, it does 
not necessarily follow that the first step F < K is purely transcendental. 
However, this is true for simple transcendental extensions. The proof of 
this simple statement illustrates some of the apparent complexities in 
dealing with transcendental extensions. 



Theorem 3.4.1 (Luroth’s Theorem) Let t be transcendental over F. If 
F < K < F(t) and K ^ F then K = F(s) for some s E F(t). 

Proof. The idea behind the proof is straightforward. Since K / F, we 
know by Theorem 3.3.6 that K < F(t) is algebraic. Indeed, for any s E 
K — F, the tower F(s) < K < F(t) is algebraic. We want to find an s E 
K — F for which [F(t):F(s)] = [F(t):K], showing that K = F(s). Recall 
from Theorem 3.3.6 that if s = f(t)/g(t) E K — F where f and g are 
relatively prime polynomials over F, then 



Let 



d s = [F(t):F(s)] = max(deg f(x),deg g(x)) 

K __\ • /a v n _i_ I . a n(^) 

x) - mm(t,K) - X + b ^x + ■•• + 



where a;(t), bj(t) E F(t). Then [F(t):K] = n and we wish to show that 
d s = n for some s E K — F. Evidently d s > n for all s E K - F. 

Note that since t is not algebraic over F, not all of the coefficients of 
p(x) can lie in F. Therefore, we may let 



fkW g 
b kW 



K-F 



for some k and assume that a k (t) and b k (t) are relatively prime. 
Consider the polynomial 



h(x) = a k (x) - 



a k(tQ 

M 4 ) 



M x ) 



Since s ^ F, we have h(x) ^ 0. But h(t) = 0 and so p(x) | h(x) over K. 
In other words, there exists q(x) E K[x] such that 

a k( x ) -^| b k( x ) = q( x )p( x ) 
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or 

a k( x ) b k( 4 ) -aicMbkM = b k (t)q(x)p(x) 

Multiplying both sides of this by 

r(t) = b 1 (t)---b n ( t ) 

gives 

(3.4.1) r(t)[a k (x)b k (t) - a k (t)b k (x)] = b k (t)q(x)r(t)p(x) 
where 

r(t)p(x) = b x (t)- • -b n (t)x n + ^2 ( b i(t)- • •b i _i(t.)a i (t)b i+1 (t)- • •b n (t)]x n ~ i 

i=l 

Now, we wish to factor out the greatest common divisor g(t) of the 
coefficients of x J (for j = 0,...,n) from the right side of this expression. 
Note that g(t) divides the gcd of any two of these coefficients, in 
particular, g(t) divides the gcd of 

b i(t)---b n (t) and b 1 (t)---b k _ 1 (t)a k (t)b k+1 (t)---b n (t) 

which is b 1 (t)-*-b k _ 1 (t)b k+1 (t)---b n (t), since a k (t) and b k (t) are 
relatively prime. Hence, once g(t) is factored out of r(t)p(x): 

r(t)p(x) = g(t)p'(t,x) 

where p'(t,x) E F[t,x] is primitive , in the sense that it is not divisible by 
any nonconstant polynomial in t, we still have as factors among the 
coefficients of p'(t,x) the polynomials b k (t) and a k (t). Thus, the degree 
of p'(t,x) with respect to t satisfies 

(3.4.2) ^-de^(p ; (t,x)) > max(deg a k (t ),deg b k (t)) = d s 
Thus, (3.4.1) becomes 

(3.4.3) r(t)[a k (x)b k (t) - a k (t)b k (x)] = b k (t)q(x)g(t)p'(t,x) 

Next we multiply both sides of (3.4.3) by a polynomial u(t) that will 
clear all of the denominators of q(x), giving 

u(t)r(t)[a k (x)b k (t) — a k (t)b k (x)] = b k (t)q'(t,x)p'(t,x) 

where p'(t,x), q'(t,x) E F[t,x]. Since p'(t,x) is not divisible by any 
nonconstant polynomial in t, we must have u(t)r(t) | b k (t)q'(t,x). Hence, 
there exists a polynomial q"(t,x) E F[t,x] for which 
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(3.4.4) a k (x)b k (t) - a k (t)b k (x) = q"(t,x)p'(t,x) 

Now, the t-degree of the left hand side of this equation is at most 
max(deg a k (t ),deg b k (t)) = d s 

and by (3.4.2) the t-degree of the right hand side is at least d g . Hence, 
the t-degree of either side of (3.4.4) is d s and (3.4.2) implies that 
<-de< 7 (q"(t,x)) = 0, that is 

(3.4.5) akMbkM - ■kOObkto = Q" Wp'O.*) 

where q"(x) E F[x]. Since the right side of (3.4.5) is not divisible by any 
nonconstant polynomial in t, neither is the left side. But the left side is 
symmetric in x and t, so it cannot be divisible by any nonconstant 
polynomial in x either. Hence, q"(x)p'(t,x) is not divisible by any 
nonconstant polynomial in x, implying that q"(x) E F, that is, 

(3.4.6) a k (x)b k (t) - a k (t)b k (x) = q"p'(t,x) 

where q" E F. Finally, since the x-degree and t-degree of the left side 
of (3.4.6) agree, this is also true of the right side. Hence by (3.4.2), 

n = x-deg( p'(t,x)) = t-deg( p'(t,x)) > d s > n 

Thus, d g = n, and the proof is complete. I 

It can be shown that Luroth’s theorem does not extend beyond 
simple transcendental extensions, but a further discussion of this topic 
would go beyond the intended scope of this book. 

We conclude with a determination of all F-automorphisms of a 
simple transcendental extension F(t). Let GL n ( F) denote the group of 
all nonsingular n x n matrices over F. The proof provides a nice 
application of Theorem 3.3.6. 

Theorem 3.4.2 Let F < F(t) be a simple transcendental extension and 
let ;4u/p(F(t)) denote the group of all automorphism of F(t) over F 

E GL 2 ( F) there is a unique <r A E i4w<p(F(t)) 



a b 
c d 



*A : 



at + b 
ct + d 



1) For each A = 
for which 
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Moreover, all automorphisms of F(t) over F have the form a A for 
some A E GL 2 (F). 

2) If A, BE GL 2 ( F), then 

‘’AB = and ^-1 = ‘’’A 1 

and cr A = cr B if and only if AB” 1 is a nonzero scalar matrix. In 
other words, the map r:GZ 2 (F)--*Att/p(F(t)) defined by r A = cr A 
is an epimorphism with kernel equal to the group of all nonzero 
scalar matrices in GL 2 (F). 

Proof. Clearly, the map <r A can be extended to a homomorphism of F(t) 
over F by setting 

A \g(t)/ g(ff A (t)) 

Since max(deg ( at + b), deg ( ct + d)) = l, Theorem 3.3.6 implies that 
[F(t):F(cr A t)] = 1 and so cr A (F(t)) = F((r A t) = F(t), showing that <7 A is 
surjective. Since a A is injective as well (fields have no nontrivial ideals), 
it is an automorphism of F(t) over F. 

We leave it to the reader to show that (7 a <t b = <x AB and that <r c = i 
if and only if C is a scalar multiple of the identity matrix. It follows 
that 

a A a A -l ~ a i~ 1 an( * * A — l^A “ a i = 1 

and so 




Also, cr A = <t b if and only if (7 AB -i ==: that is, if and only if AB” 1 is a 

scalar multiple of the identity. 

If a E Aut^{ F(t)) then F(t) = <r(F(t)) = F(crt) and so [F(t):F(<rt)] = 
1, which by Theorem 3.3.6 implies that at = <r A t for some 2x2 matrix 
over F. Hence, a = cr A . Since o'” 1 also has the form a B for some matrix 
B, we have 

1 - cr A a B - a AB 



which implies that AB = al, for some a E F, whence A is nonsingular. I 



Exercises 

1. Find an example of a purely transcendental extension F < E with 
two transcendence bases B and C such that E = F(B) but F(C) is 
a proper subfield of E. 

2. Let F < E and F < K. Show that [EK:K] t < [E:F] t . 
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3. Let F < E < K and let T € K - E. Show that [E(T):F(T)] t < [E:F] t 
with equality if T is algebraically independent over F or algebraic 
over F. 

4. Use the results of the previous exercise to show that if F < K < E 
and F < L < E then [KL:F] t < [K:F] t + [L:F] t . 

5. Let F be a field of characteristic ^ 2 and let u be transcendental 
over F. Suppose that u 2 + v 2 = l. Show that F(u,v) is a purely 
transcendental extension by showing that F(u,v) = F(w) where 
w = (1 + v)/u. 

6. Let F < K < E and suppose that SCE is algebraically 
independent over K. Prove that F(S) < K(S) is algebraic if and 
only if F < K is algebraic. 

7. Show that the converse of part 3) of Theorem 3.3.5 is false by 
describing an extension E of F that is not purely transcendental, 
but for which every a E E — F is transcendental over F. 

8. Prove that the transcendence degree of R over Q is | R | . 

9. Show that [C:Q] t = | C | . 

10. (An extension of Luroth’s Theorem) Suppose that F < E is purely 
transcendental. Show that any simple extension of F contained in 
E is transcendental over F. 

11. With regard to Theorem 3.4.2, show that 0^03 = <r A 3 and <Tq = 1 
if and only if C is a scalar multiple of the identity matrix I. 

12. Prove Lemma 3.1.1. 




Chapter 4 

Separability 



4.1 Separable Polynomials 

Let us recall a few facts about separable polynomials from Chapter 1. 

Definition An irreducible polynomial p(x) £ F[x] is separable if it has no 
multiple roots in any extension of F. An irreducible polynomial that is 
not separable is inseparable. □ 

Theorem 4.1.1 

1) An irreducible polynomial p(x) is separable if and only if 
p'(x) # 0. 

2) If F is a field of characteristic 0, or a finite field, then all 
irreducible polynomials over F are separable. 

3) Let char(F) = p ^ 0 and let p(x) be irreducible. 

a) If p(x) is inseparable, then there exists a positive integer d 
such that p(x) = q(x^), where q(x) is separable. In this case, 
all roots of p(x) have multiplicity p d . 

b) If p(x) = h(x pd ) where h(x) is any nonconstant polynomial and 
d is a positive integer, then p(x) is inseparable. 

4) Inseparable polynomials exist. □ 

The exponent d in part 3a) of the previous theorem is quite 
important and deserves a special name. Note that it can be 
characterized as the largest integer for which p(x) = q(x p ). 
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Definition Let p(x) £ F[x] be an irreducible polynomial. If char(F) = 
Pt^O, the integer d for which p(x) = q(x pd ), with q(x) separable, is 
called the radical exponent of p(x). If char(F) = 0, the radical exponent 
of p(x) is defined to be 0. If a is algebraic over F, the radical exponent 
of a over F is the radical exponent of min(a,F). □ 

The following definition allows us to handle the cases char(F) = 0 
and char(F) = p 0 simultaneously. 

Definition The exponent characteristic expchar(F) of a field F is defined 
to be 1 if char(F) = 0 and char(F) otherwise. 0 

Thus, any irreducible polynomial p(x) has the form p(x) = q(x pd ) 
where q(x) is separable, p is the exponent characteristic of F and d is 
the radical exponent of p(x). Moreover, p(x) is separable if and only if 
its radical exponent is 0. 

Definition Let F < E. Then a £ E is separable over F if a is algebraic 
over F and its minimal polynomial min(a^ F) is separable. The 
extension F < E is separable (or E is separable over F) if every element 
of E is separable over F. Q 

Before proceeding, we record a useful lemma. If F is a field and 
SCF then S n denotes the set {s 11 1 s £ S}. 

Lemma 4.1.2 Let F < E be algebraic with expchar(F) = p and let SCE. 

1) F(S pk ) = F([F(S)] pk ) for any k > 0. 

2) F(S) = F(S P ) holds for some k > 1 if and only if it holds for all 
k > 1. 

3) F = F pk holds for some k > 1 if and only if it holds for all k > 1. 
Proof. Part 1) follows from the fact that [F(S)] pk = F pk (S pk ) and so 

F([F(S)] pk ) = F(F pk (S pk )) = F(S pk ) 

To prove part 2), suppose that F(S) = F(S pk ) for some k > 1. Using part 
1), we have 

F(S) = F(S«*) = F([F(S)] pk ) < F([F(S)] pk ' 1 ) = F(S pk ' 1 ) 

from which we conclude that F(S) = F(S pl ) for all r < k. In particular 
F(S) = F(S P ) and so again using part 1), we obtain 
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F(S) = F(S pk ) = F([F(S)] pk ) = F([F(S p )] pk ) = F(S pk+1 ) 
and so F(S) = F(S pr ) for all r > k as well. For part 3), we observe that 

F pk < F p < F 

and so F = F pk holds for some k > 1 if and only if F = F p , which holds 
if and only if F = F pk for all k > 1. I 



4.2 Separable Degree 

If F < E is algebraic and if cr:F— >L is an embedding of F into an 
algebraically closed field L, we let S^EjF) denote the set of all 
extensions of <r to an embedding of E into L. Remarkably, the 
cardinality of 8 a (E,F) does not depend on a or L. 



Theorem 4.2.1 If F < E is algebraic and tfiF— »L is an embedding of F 
into an algebraically closed field L then the cardinality of g^EjF) 
depends only on the extension F < E and not on a or L. In other words, 
if r: F— *L' is an embedding with V algebraically closed, then 
| 8 <r (E,F) | = | 6 r (E,F) | . 

Proof. Observe first that if a is an extension of a to E then <fE is 
algebraic over <tF and therefore contained in the algebraic closure of a F 
in L. Hence we may as well assume that L is an algebraic closure of crF. 
Similarly, we may assume that L' is an algebraic closure of rF. 

Referring to Figure 4.2.1, the map r(j“ 1 :cr(F)— >r(F) is an 
isomorphism that can be extended, by Theorem 2.8.4, to an embedding 
A:L— >L'. Since aF < L is algebraic, so is rF < AL, and since AL is 
algebraically closed, we have AL = L', implying that A:L— »L' is an 
isomorphism. 

If or G ^(EjF) then the map A<f:E— is an embedding of E into V 
extending r on F. This defines a map from ^(EjF) to 8 r (E,F) given by 
<n-»A<j. It is clear that this map has an inverse given by rh->A~ 1 r and so 
both maps are bijections. I 



L <■ 

A 



A =t a ' 1 



L 

A 



X<j 

< 



A 

algebraic 



> 



F >a (F) 

a 



Figure 4.2.1 
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In view of Theorem 4.2.1, we may make the following definition. 

Definition Let F < E be an algebraic extension and let <r:F— »L be an 
embedding of F into an algebraically closed field L. The cardinality of 
the set ^(EjF) is called the separable degree of E over F and is denoted 
by [E:F] S . D 

It will be convenient for our present discussion to adopt the following 
definitions, at least temporarily. 

Definition An algebraic extension F < E is degreewise separable if 
[E:F] S = [E:F]. An algebraic extension F < E is separably generated if 
E = F(S) where each a £ S is separable over F. 0 

We next prove that the separable degree is multiplicative. 

Theorem 4.2.2 If F < K < E then [E:F] S = [E:K] S [K:F] S . 

Proof. The set 8^(K,F) of extensions of the inclusion map j: F—»E to an 
embedding j:K — >E has cardinality [K:F] S . Each such extension j £ 
8 ; {K,F) can be further extended to an embedding /: E-*E. Clearly, the 
resulting extensions are all distinct and so 

|S/E,F)| > | 8j(K,F) | |8j(E,K)| 

On the other hand, if a £ Sj(E,F) and <7 0 :K— >E is the restriction of a to 
K then <r Q is the extension of j: F— »E to K, hence an element of 8 ; (K,F). 
Since a is the extension of <r 0 to E, a is obtained by a double extension 
of j: F— *E and so equality holds in the inequality above. I 



4.3 The Simple Case 

Now let us consider simple extensions in the present context. Let 
F < F(a) be algebraic. If p(x) = ra 2 n(c*,F) and if j:F — >F is the inclusion 
map then Theorem 2.8.3 implies that [F(a):F] s = | 8 J (F(a),F) | is equal 
to the number of distinct roots of p(x). If p(x) is separable, it has 
deg p(x) = [F(a):F] distinct roots and so 

[F(a):F] s = [F(o):F] 

If p(x) = q(x pd ) has radical exponent d > 1, then each root of p(x) has 
multiplicity p d and so 

p d [F(a):F] s = deg p(x) = [F(a):F] 
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We thus have the following theorem. 

Theorem 4.3.1 Let F < F(a) be algebraic with expchar(F) = p. If 
mm(a, F) has radical exponent d, then 

(4.3.1) p d [F(a):F] s = [F(a):F] 

In particular, [F(a):F] s | [F(c*):F]. Moreover, the following are 
equivalent. 

1) a is separable over F. 

2) F < F(a) is degreewise separable; that is, [F(a):F] s = [F(a):F]. 

3) F < F(a) is separable. 

Proof. We have seen that (4.3.1) holds and since a is separable if and 
only if its radical exponent is 0, it follows that 1) and 2) are equivalent. 
Clearly 3) implies 1). To see that 2) implies 3), let /? E F(a) and 
consider the tower F < F (/?) < F(a). Then 

[F(a):F(/?)] s [F(/?):F] s = [F(a):F] 8 = [F(a):F] = [F(a):F(/?)][F(/?):F] 

Since F(a) = F(/?)(a), the extension F (/?) < F(c*) is simple and so each 
factor on the far left divides the corresponding factor on the far right, 
implying that the corresponding factors are equal. In particular, 
[F(/?):F] S = [F(/?):F], showing (by the equivalence of parts 1 and 2) that 
(3 is separable over F. Hence F < F(a) is separable. I 

Note that, according to the previous theorem, if a is separable so is 
any polynomial in a. The following is another characterization of 
separable elements. 

Theorem 4.3.2 Let a be algebraic over F, with expchar(F) = p. Then a 
is separable over F if and only if 

F(a) = F(a pk ) 

for some k > 1, and hence for all k > 1. 

Proof. Lemma 4.1.2 allows us to confine our attention to k = 1. Suppose 
a is separable over F. First suppose that a is separable over F. The 
polynomial (x — a) p = x p — a 9 E F(a p )[x] has a as a root and so there 
exists an r < p such that 



min(a,F(a p )) = (x — a) r 
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Since p(x) = mm(a,F) also has coefficients in F(a p ), we have 
(x — a) r | p(x). But p(x) is separable, and so r = 1. Thus r(x) = x — a, 
implying that a E F(a p ) and consequently F(a) = F(a p ). 

Conversely, suppose that F(a) = F(a p ) and let p(x) = min(a,F). If a 
is not separable over F then p(x) = q(x p ). Since q(a p ) = p(a) = 0, we 
get 

[F(a p ):F] < deg q(x) = J[P(a):F] 
which is contrary to F(a) = F(a p ). Thus a is separable over F. I 



4.4 The Finite Case 

Now we consider an arbitrary finite extension F < E. By Theorem 
2.5.2, we may let E = F(a 1 ,...,a n ) where <q is algebraic over F. Taking 
separable degrees in the tower 



(4.4.1) 

gives 



F < F(a x ) < F(a 1 ,a 2 ) < • • • < F(a 1 , . . . , a n ) 



(F(<*i, • • ■ , « n ) :F ls = II [ F ( a i> • • • - a i) :F ( a i> • • • . a i-i)] s 
i=l 



Since each step on the right is simple, Theorem 4.3.1 implies that each 
separable degree on the right divides the corresponding vector space 
degree, and so 

[F(a 1 ,...,a n ):F] s |[F(a 1) ...,a n ):F] 



Theorem 4.4.1 Let F < E be finite. Then [E:F] S | [E:F]. Also, the 
following are equivalent. 

1) E is separably generated. 

2) F < E is degreewise separable; that is, [E:F] S = [E:F]. 

3) F < E is separable. 

Proof. [1=>2] Suppose that E = F(S) where the elements of S are 
separable over F. The finiteness of F < E implies that E = F(a 1 ...a n ), 
for some n > 0, where cq E S. Since cq is separable over F(a 1 ,...,af|_ 1 ), 
each step in the tower (4.4.1) is generated by a single separable element. 
Hence, each step is degreewise separable and the multiplicativity of 
degrees implies that F < E is degreewise separable. [2=>3] Let /? E E and 
consider the tower F < F(/?) < E. Since F < E is degreewise separable, 
so is F < F(/3) and so (3 is separable over F. Hence, F < E is separable. 
[3=>1] This is clear from the definition. I 
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Thus, for finite extensions, the notions of separability, degreewise 
separability and separably generated are equivalent. Note that if 
F < K < E is finite then F < E is separable if and only if K < E and 
F < K are separable. We will show later that the class of separable 
extensions (finite or otherwise) is distinguished. Let us have another 
characterization of finite separable extensions. 

Theorem 4.4.2 If F < E is separable then E = F(E pk ) for all k > 1. By 
way of converse, if F < E is finite and E = F(E pk ) for some k > 1, then 
F < E is separable. 

Proof. Suppose F < E is separable. Lemma 4.1.2 allows to to confine 
our attention to k = 1. For any a G E, we have F(a) = F(a p ) C F(E P ) 
and so EC F(E P ). The reverse inclusion is obvious and so E = F(E P ). 

Now suppose that E = F(E P ). Since F < E is finite, we have E = 
F(S P ) for some finite subset SCE. Since E = F(S P ) < F(S) < E, we 
have E = F(S) = F(S P ) and so Lemma 4.1.2 implies that E = F(S pk ) for 
all k > 1. If d is the maximum of the radical exponents of the elements 

of S then every element of S pd is separable over F and so E = F(S pd ) is 

separably generated over F and therefore separable over F. I 

Corollary 4.4.3 Let F < E be a separable extension and let SCE. 

1) If S spans E over F, then S pk spans E over F, for any k > 1. 

2) If F < E is finite and S is linearly independent over F, then S p is 

linearly independent over F, for any k > 1. 

3) If F < E is finite and S is a basis for E over F, then S p is a basis 
for E over F, for any k > 1. 

Proof. If S spans E over F, then S p spans E p over F p , and hence also 
over F. Hence S p spans F(E P ) = E over F. Repeating this argument 
proves part 1). For part 2), since F < F(S) is separable and S spans 
F(S) over F, we conclude from part 1) that S p spans F(S) over F. Since 

I S P " I = IS I < oo 

it follows that S p is a basis for F(S) over F and is therefore linearly 
independent over F. Part 3) follows from parts 1) and 2). I 

We now prove that all finite separable extensions are actually simple 
extensions. 

Theorem 4.4.4 If F < E is a finite separable extension then there exists 
a 7 G E such that E = F(7). If F is an infinite field, there exist infinitely 
many such primitive elements 7. 

Proof. If F is a finite field, then so is E, and we appeal to the fact that 
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the multiplicative group E* of nonzero elements of E is cyclic. If E* = 
(7) then E = F(7) and E is simple over F. Let us now assume that F is 
an infinite field. 

Since F < E is finitely generated, it is sufficient to consider the case 
E = F(a,/?), and then appeal to an inductive argument. Let (Tj ,..., a 
be the distinct embeddings of E into F over F. Consider the polynomial 

p( x ) = n - <7 j)( a ) + fa - Oj)(0)*] e F[x] 
i #j 

Since none of the linear factors on the right is 0, we conclude that 
p(x) ^ 0. Since F is infinite, there must infinitely many elements s E F 
such that p(s) ^ 0. Hence (cq - <7j)(c*) + s(<7 i - <7j)(/?) 0 for all i^j, 

that is, the n elements 7. = <rp 4- s a-fl = -f s/3) are distinct. But 
each 7j is a root of min(a + s/?,F) and so 

[E:F] > [F(a + s/?):F] > n = [E:F] S = [E:F] 

from which it follows that E = F(a + s/3). I 

Corollary 4.4.5 If F has characteristic 0 or if F is a finite field then any 
finite extension of F is simple. □ 

We can improve upon Theorem 4.4.4 without too much additional 
work. This result will prove useful to us later. 

Theorem 4.4.6 If E = F(a 1 ,...,a n ,/?) where aj is separable over F and /? 
is algebraic over F then F < E is a simple extension. 

Proof. If F is finite, then E is finite, and therefore F < E is simple. Let 
us assume that F is infinite. Theorem 4.4.4 implies that E = F(c*,/?) for 
some a separable over F. We may proceed as in the proof of that 
theorem to obtain an element /3 + sa for which the elements <7 j(/? + sa) 
are distinct, where cr 1 ,...,(T n are the distinct embeddings of E into F 
over F and a l = t. Note that the set {o^a, . . . ,<r n a} contains a complete 
set of roots of p a (x) = mm(a,F) and {ctj/?,...,^/?} contains a complete 
set of roots of p^(x) = mm(/?,F). 

Let q(x) = p p(/3 + sa — sx). Since ap = a, we have 

qfa«) = pp(P) = 0 

and since <r 1 /? + sa-^a — sap / a-fl for i / 1, we have 
q((T i a) = p /3 (or 1 /? + s<r 1 a - sap) ^ 0 

for i ^ 1. Hence, the polynomials p a (x) and q(x), both of which have 
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coefficients in F(/? -f era), have precisely one root in common, namely 
<T x ot = a. Thus, since p a (x) has no multiple roots, the greatest common 
divisor of p a (x) and q(x) is x — a, which must have its coefficients in 
F (f3 + sa) as well. In other words a E F(/? -f sa), from which it follows 
that /? E F (/? + sa), whence F (a,/?) = F(/? + sa). I 



4.5 The Algebraic Case 

For arbitrary algebraic extensions F < E, we have the following. 

Theorem 4.5.1 Let expchar(F) = p. An algebraic extension F < E is 
separable if and only if it is separably generated. If F < E is separable 
then E = F(E pk ) for all k > 1. 

Proof. If F < E is separable then E is separably generated (by itself) 
over F. For the converse, assume that E = F(S) where each a E S is 
separable over F and let f3 E E. Then /? E F(S 0 ) for some finite subset 
S 0 C S. Since F < F(S 0 ) is finitely generated and algebraic, it is finite. 
Thus, Theorem 4.4.1 implies that F < F(S 0 ) is separable. Hence /? is 
separable over F and so F < E is separable. The last statement was 
proved in Theorem 4.4.2. I 

We, may now establish that the class of separable extensions is 
distinguished. 

Theorem 4.5.2 The class of separable extensions is distinguished. It is 
also closed under the taking of arbitrary composites. If F < E is 
separable and E nc is the normal closure of E over F then F < E nc is 
separable. 

Proof. Let F < K < E. If all extensions are finite, we have already 
shown (by a degree argument) that F < E is separable if and only if 
F < K and K < E are separable. In general, we leave it as an exercise to 
show that if F < E is separable then F < K and K < E are separable. 
Suppose that F < K and K < E are separable and let a E E. Let CCK 
be the set of coefficients of p(x) = rm'n(a,K). Then p(x) = mm(a, F(C)) 
and so a is separable over F(C). It follows that each step in the tower 
F < F(C) < F(C,a) is finite and separable, implying that a is separable 
over F. This shows that F < E is separable and completes verification of 
property Dl) in the definition of distinguished class. 

For property D2), let F < E be separable and let F < K. Since every 
element of E is separable over F it is also separable over the larger field 
K. Hence EK = K(E) is separably generated and is therefore separable. 
The fact that separable extensions are closed under the taking of 




88 



4 Separability 



arbitrary composites follows from the finitary property of arbitrary 
composites. That is, each element of an arbitrary composite involves 
elements from only a finite number of the fields in the composite and so 
is an element of a finite composite, which is separable. 

Finally, the normal closure E nc is the composite V (<tE) for cr E 
Hom F ( E,E). Since F < E is separable and <r:E— htE is an isomorphism 
over F, the elements a E E and a a E crE have the same minimal 
polynomial and so the separability of a over F implies that of aa, 
whence F < crE is separable. Since separability is preserved under 
composites, F < E nc is separable. I 



4.6 Pure Inseparability 

The antithesis of a separable element is a purely inseparable element. 

Definition An element a algebraic over F is purely inseparable over F if 
its minimal polynomial min(a,F) has the form (x — a) n for some n > 1. 
An algebraic extension F < E is purely inseparable if every element of E 
is purely inseparable over F. □ 

Note that any a E F is purely inseparable over F. In fact, an element 
a is both separable and purely inseparable over F if and only if a E F. 
It follows that, for extensions of fields of characteristic 0 or finite fields, 
there are no “interesting” purely inseparable elements. 

Example 4.6.1 Let char(F) = 2. If t is transcendental over F, then t is 
purely inseparable over F(t 2 ), since its minimal polynomial over F(t 2 ) is 
x 2 - 1 2 = (x - 1) 2 . □ 

Example 4.6.2 Here we present an example of an element that is neither 
separable nor purely inseparable over a field F. Let char(F) = p and let 
a E F be nonzero. Let t be transcendental over F and let 

s t p + a 

According to Theorem 3.3.6, F(s) < F(t) is algebraic and has degree 
equal to p 2 . Since t is a root of the monic polynomial 

2 

p(x) = x p — sx p — sa 

2 

of degree x p over F(s), this must be the minimal polynomial for t over 
F(s). Since p(x) = q(x p ), we deduce that t is not separable over F(s). 
On the other hand, if t were purely inseparable over F(s), there would 
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exist /? E F(s) for which 

x p — sx p — sa = (x — /?) p = x p — (3 P 

which would imply that s = 0, which is not the case. Hence, t is neither 
separable nor purely inseparable over F(s). 0 

Definition Let F < E be finite. Since [E:F] S | [E:F], we may write 



[E:F] = [E:F].[E:F]i 



where [E:F]j is the inseparable degree or degree of inseparability of E 
over F. 0 

Note that, while the separable degree is defined for infinite 
extensions, the inseparable degree is defined only for finite extensions. 

Definition If F < E is algebraic and [E:F] S = 1, we say that F < E is 
degreewise purely inseparable. When F < E is finite, this is equivalent 
to [E:F]j = [E:F]. □ 

Theorem 4.6.1 Let F < E be a finite extension with expchar(F) = p. 

1) If F < K < E then [E:F] S = [EiK^K:^. 

2) F < E is separable if and only if [E:F]j = 1. 

3) If a E E then [F(a):F]j = p d where d is the radical exponent of a . 

4) [E:F]j is a power of p. 

Proof. The first three statements are clear. The last statement follows 
from the fact that F < E is finitely generated and the inseparable degree 
is multiplicative. We leave the details to the reader. I 

We next characterize purely inseparable elements. 

Theorem 4.6.2 Let a be algebraic over F, with radical exponent d and 
let p(x) = min(a,F). The following are equivalent. 

1) a is purely inseparable over F. 

2) The polynomial (x — a) n has coefficients in F, for some n > 1. 

3) p(x) = (x - a) pd = x pd — a pd . 

4) a is a root of x pk — /?, for some /? E F and k > 0. 

5) c* pk E F for some k > 0. 

6) d is the smallest nonnegative integer for which o pd E F. 
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Proof. We establish only those implications that are not immediate. 
Recall that p(x) = q(x pd ) where q(x) is separable over F. 

[2=>3] If 2) holds then q(x pd ) | (x - a) n and so q(x pd ) = (x — a) r for 
some 1 < r < n. It follows that r = mp d , where m = deg q(x). Hence, 

q(x pd ) = (x - a) mpd = (x p “ - </) m 

Thus q(x) = (x — c* pd ) m and the separability of q(x) implies that m = 1, 
whence 

r = p d and p(x) = (x — a) p 
[5=>6] If 5) holds then 

r(x) = x pk — a pk = (x — a) pk 

is a polynomial over F with r(a) = 0. Hence q(x pd ) | (x — ct) pk , showing 
that k > d. Since r(x) £ F[x], the fact that 2) implies 3) shows that 

p(x) = x pd — a pd 

and so a pd £ F. Hence d is the smallest integer for which a pd £ F. 

[6=>1] If 6) holds, then 

r(x) = (x — a) pd = x pd — a pd 

is a polynomial over F with a as a root, and so p(x) | r(x). Hence, p(x) 
has the form (x — a) n for some n > 1 and a is purely inseparable over 
F. I 

Theorem 4.6.3 Let F < E be algebraic. The following are equivalent. 

1) E is purely inseparably generated; that is, generated by purely 
inseparable elements. 

2) F < E is degreewise purely inseparable; that is, [E:F] S = 1. 

3) F < E is a purely inseparable extension. 

Proof. [1=>2] Suppose first that E = F(I) where all elements of I are 
purely inseparable over F. Any embedding a:E — »L over F is uniquely 
determined by its values on the elements of I. But if a £ I then aa is a 
root of the minimal polynomial mzn(a,F) and so aa = a. Hence a must 
be the identity and [E:F] S = 1. 

[2=>3] Let a £ E. Then [F(a):F] s = 1 and since F < F(a) is a finite 
extension, Theorem 4.3.1 implies that 

p d = [F(a):F] = deg min(a,F) 
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Since mzn(a, F) = q(x p ), it follows that q(x) is linear and so 

rmn(a,F) = x pd — /? 

for some /3 E F, which implies by Theorem 4.6.2 that a is purely 
inseparable over F. [3=>1] This is clear. I 

We can now show that the class of purely inseparable extensions is 
distinguished. 

Theorem 4.6.4 The class of purely inseparable extensions is 
distinguished. It is also closed under the taking of arbitrary composites. 

Proof. Let F < K < E. Since pure inseparability is equivalent to 
degreewise pure inseparability and [E:F] S = 1 if and only if [E:K] S = 1 
and [K:F] S = 1, it is clear that Dl) holds. For D2), suppose that F < E 
is purely inseparable and F < K. Since every element of E is purely 
inseparable over F, it is also purely inseparable over the larger field K. 
Hence EK = K(E) is purely inseparably generated and therefore purely 
inseparable. We leave proof of the last statement to the reader. I 



4.7 Separable and Purely Inseparable Closures 

Let F < E. According to Theorem 4.4.1, if a, /? E E are separable 
over F then F(c*,/?) is separable over F. It follows that a ± (3, ct(3, and 
a” 1 (for a ^ 0) are separable over F. Hence, the set of all elements of E 
that are separable over F is a subfield of E. A similar statement holds 
for purely inseparable elements. 

Definition Let F < E. The field 

F sc = {a G E | a separable over F} 

is called the separable closure of F in E. The field 

F 1C = {a £ E | a is purely inseparably over F} 

is called the purely inseparable closure of F in E. □ 

The separable closure allows us to decompose an arbitrary algebraic 
extension into separable and purely inseparable parts. 
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Theorem 4.7.1 Let F < E be algebraic. 

1) In the tower F < F sc < E the first step is separable and the second 
step is purely inseparable. 

2) E lE:Fli C F sc . 

3) Any embedding <r:E— »E is uniquely determined by its restriction 
to F sc . 

Proof. For part 1), if a G E has radical exponent d, then ram(a, F) = 
q(x pd ) where q(x) = mzn(a pd ,F) is separable and so a pd G F sc . Thus, 
Theorem 4.6.2 implies that a is purely inseparable over F sc . This shows 
that F sc < E is pusrely inseparable. For part 2), since p d = 
[F(a):F]| | [E:F] i we see that 



for all a G E and so E 1 C F sc . We leave proof of the last statement 
to the reader. I 

Corollary 4 . 7.2 Let F < E be finite. Then [E:F] S = [F SC :F] and [E:F]j = 
[E:F SC ]. 0 

Part 1 of Theorem 4.7.1 shows that any algebraic extension can be 
decomposed into a separable extension followed by a purely inseparable 
extension. In general, the reverse is not possible. Although F < F 1C is 
purely inseparable, the elements of E — F IC need not be separable over 
F; they are simply not purely inseparable over F. However, it is not 
hard to see when F lc < E is separable. 

Theorem 4 . 7.3 Let F < E be algebraic. Then F ic < E is separable if and 
only if E = F sc F ic . 

Proof. If F 1C < E is separable then so is F SC F 1C < E. But since F sc < E is 
purely inseparable, so is F SC F 1C < E. Thus, we have E = F SC F 1C . 
Conversely, if E = F sc F ic then F ic < F sc F ic , being a lifting of a 
separable extension F < F sc , is also separable. I 

We can do better than the previous theorem when F < E is a normal 
extension, which includes the case E = F. Let G = Aut F ( E) be the set of 
all automorphisms of E over F. Since F < E is normal, G is also the set 
of all embeddings of E into F over F. We define the fixed field of G in 
E by 

F( G) = {a G E | aa = a for all a G G} 




4 Separability 



93 



Theorem 4.7.4 Let F < E be a normal extension. Let G = Autp(E) and 
let F(G) be the fixed field of G in E. Then F( G) = F lc . Furthermore, in 
the tower F < F 1C < E, the first step is purely inseparable and the 
second step is separable. 

Proof. Let a £ F(G). If (3 £ F is a root of p(x) = min(c*,F) then there 
exists an embedding a: E-+F over F for which era = /?. But era = a and 
so P = a. Hence mm(a,F) has only one root and so a £ F 1C . On the 
other hand, if a £ F 1C then any <j £ G must map a to itself, since it 
must map a to a root of min(a,F). Hence a £ F(G). This proves that 
F(G) = F ic . 

Now let a £ E and p(x) = mm(a,F(G)). Let q(x) = n( x — r i) where 
R={ri,...,r n } is the set of distinct roots of p(x) in E. Since any a £ G 
is a permutation of R, we deduce that q <T (x) = q(x) and so the 
coefficients of q(x) lie in F( G). Hence q(x) = p(x) and a is separable 
over F( G). I 

Corollary 4.7.5 If F < E is normal then F 1C < E is separable and E = 
F sc F ic . 0 

Let us conclude this section with a characterization of simple 
algebraic extensions. If E = F(a) is a simple algebraic extension of F 
and if d is the radical exponent of a, we have seen that p d = [E:F]j is 
the smallest nonnegative power of p such that a pd is separable over F, or 
equivalently, such that E p C F sc . It turns out that this property 
actually characterizes simple algebraic extensions. Before proving this, 
we give an example where this property fails to hold. 

Example 4.7.1 Let u and v be transcendental over K with char(K) = 
p ^ 0. Let E = K(u,v) and F = K(u p ,v p ). It is easily seen that F < E is 
purely inseparable with [E:F]j = p 2 . However, a £ E implies a p £ F and 
so E p C F. 0 

We next require the following useful lemma. 

Lemma 4.7.6 If char(F) = p ^ 0 and a £ F, a £ F p then f(x) = 
irreducible for every k > 1. 

Proof. Let /) £ F be a root of f(x) = x p * — a. Then 

f(x) = (x-/?) pk 

If p(x) = rmn(/?,F) then p(x) | f(x) and so p(x) = (x — /?) pd 
d < k. But if d < k then 



:X pk - 



a is 



for some 
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and so 

a = /? pk = (/3 pk "V G F p 

contrary to assumption. Hence d = k and f(x) is irreducible. I 

Theorem 4.7.7 Let F < E be a finite extension with [E:F]| = p d . Then 
F < E is simple if and only if d is the smallest nonnegative integer for 
which E P* C F sc . 

Proof. We have seen that if F < E is simple then d is the smallest such 
nonnegative integer. For the converse, note first that if F is a finite field 
then so is E, implying that E* is cyclic and so F < E is simple. Let us 
assume that F is an infinite field and look at the second step in the 
tower F < F sc < E. This step is purely inseparable. Since F sc < E is 
finite, we have 

E = F sc (/3 1 ,...,/3 n ) 

If for some k < d, we have £ F sc for all i, then E pk C F sc , contrary to 
hypothesis. Hence one of the /?j’ s, say /?, satisfies 

/? pd £ F sc , /? pk £ F sc for k<d 

It follows that 

[F sc (/?):F sc ]i = p d = [E:F]j > [E-.F 8 ^ 

Since F sc (/3) < E, we have [F sc (^):F sc ] i = [EiF 80 ^ and since the 
extensions involved are purely inseparable, we get [F SC (/?):F SC ] = [E:F SC ], 
Hence, E = F sc (/?). 

Our tower now has the form F < F sc < F sc (/?) where /? is purely 
inseparable over F sc . In addition, F < F sc is finite and separable and 
therefore simple. Thus there exists a £ F sc such that F sc = F(a) and the 
tower takes the form F < F(a) < F(c*,/?) where a is separable over F 
and P is purely inseparable over F(a). By Theorem 4.4.6, the extension 
F < F(a,f3) is simple. I 

Note that Theorem 4.7.7 implies that the extension F < E of 
Example 4.7.1 is not simple. 



4.8 Perfect Fields 

Definition A field F is perfect if every irreducible polynomial over F is 
separable. 0 
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It is clear from the definitions that if F is perfect then any algebraic 
extension of F is separable. Conversely, suppose that every algebraic 
extension of F is separable. If p(x) E F[x] is irreducible and a is a root 
of p(x) in some extension of F then F < F(a) is algebraic and so a is 
separable over F, that is, p(x) is separable. Thus, F is perfect. 

Theorem 4.8.1 A field F is perfect if and only if every algebraic 
extension of F is separable over F. □ 

Theorem 4.8.2 Every field of characteristic 0 and every finite field is 
perfect. □ 

Note that if expchar(F) = p then F p = {a 9 | a E F} is a subfield of F. 
The map <£:F — >F defined by a p a = c* p is called a Frobenius map. It is a 
monomorphism since a p ± /? p = (a ± /?) p . 

Theorem 4.8.3 Let F be a field with expchar(F) = p. The following are 
equivalent. 

1) F is perfect. 

2) F = F pk for some (and hence all) k > 1. 

3) The Frobenius map <r k is an automorphism, for some (and hence 
all) k > 1. 

Proof. [1=>2] Suppose F is perfect. Let a E F and consider the 
polynomial p(x) =x p -aE F[x]. If f3 is a root of p(x) in a splitting 
field then /? p = a and so 

p(x) = X p - /3 P = (x - /?) p 



Hence /? is purely inseparable over F. But /? is also separable over F and 
therefore /3 E F. Hence, a E F p for all a E F, that is, F C F p . Since the 
reverse inclusion is manifest, we have F = F p . Lemma 4.1.2 implies the 
desired result. 

[2=>1] We may assume that p > 1. If 2) holds then Lemma 4.1.2 
implies that F p = F. It follows that if p(x) E F[x] is irreducible but not 
separable, then 

p« = = E b f<«') p = (E b /) p 

contradicting the fact that p(x) is irreducible. Hence, p(x) is separable 
and so F is perfect. Since the Frobenius map is a monomorphism, 
statements 2) and 3) are easily seen to be equivalent. I 

While it is true that any algebraic extension of a perfect field is 
perfect, not all subfields of a perfect field need be perfect. 
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Theorem 4.8.4 

1) If F < E is algebraic and F is perfect then E is perfect. 

2) If F < E is finite and E is perfect then F is perfect. 

Proof. Part 1) follows from Theorem 4.8.1 and the fact that every 
algebraic extension of E is an algebraic extension of F. For part 2), let 
expchar(F) = p and suppose first that F < E is simple. Thus, E = F(a) 
is perfect and a is algebraic over F, with minimal polynomial p(x) = 
J^apd. Then 

0 = [Ey] p = £a p a pi 

Hence, the degree of a p over F p is no greater than the degree of a over 
F, in symbols, [F p (a p ):F p ] < [F(a):F]. But F p (a p ) = [F(a)] p = F(a) 
since F(a) is perfect and so [F(a):F p ] < [F(a):F]. Since F p < F, equality 
holds and F p = F, whence F is perfect. Since F < E is finitely generated 
by algebraic elements, the result follows by repetition of the previous 
argument. I 

Note that we cannot drop the finiteness condition in part 2) of the 
previous theorem since, for example, F < F is algebraic and F is perfect 
even if F is not. 

Perfect Closures 

Let char(F) = p 0 and let F be an algebraic closure of F. For each 
k > 1, the set 

F 1/pk = {a € F | c/ 6 F} 
is a subfield of F. Moreover, we have 

FCF 1/p CF 1/p2 C--- 



The union 

o° 

pcl( F) = (J F 1/p 

k=l 

which is also a subfield of F, is known as the perfect closure of F in F, 
which name is justified by the following theorem. 

Theorem 4.8.5 Let F be a field of characteristic p ^ 0. Then pcl( F) is 
the smallest perfect subfield of F containing F. 

Proof. To see that pcl( F) is perfect, observe that if a € pcl( F) then 
a p G F for some k > 1. Hence, letting 0 be a root of x p — a in F, we 
have a = /? p , where 
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/? p 



k+l 



= a 



pk G F 



and so /? G pc/(F). This shows that pc/(F) C [pc/(F)] p . Since the reverse 
inclusion is obvious, it follows that pcl( F) is perfect. 

In addition, if F < K < pcl{ F) and a G pcl( F) — K, the fact that a pk G 
F for some k > 1 implies that a is purely inseparable over F and hence 
also over K. But since a is not in K, it cannot be separable over K as 
well. Thus K is not perfect. I 



Exercises 

1. Let F < K < E. If F < E is separable, show that F < K and K < E 
are separable. 

2. Prove that if F < E is finite and separable then there are only 
finitely many intermediate fields between E and F. 

3. Show that all algebraically closed fields are perfect. If t is 
transcendental over F then F(t) is not perfect. 

4. Let a be algebraic over F, where expchar(F) = p and let d be the 
radical exponent of a. Show that c* p is separable over F if and 
only if k > d. 

5. Let p and q be distinct primes. Then Q < Q(^/p,^/q) is finite and 
separable and therefore simple. Describe an infinite class of 
primitive elements for this extension. Find the minimal 
polynomial for each primitive element. 

6. Let E = F(a 1 ,...,a n ) be separable over an infinite field F. Prove 

that there are an infinite number of n-tuples (aj,...^) G F n for 
which E = F(a 1 o 1 H h a n c* n ). 

7. Show that the class of purely inseparable extensions is closed 
under the taking of arbitrary composites. 

8. Let F < E. Define the purely inseparable closure of F in E and 
show that it is a field. 

9. If F < E is algebraic prove that any embedding <r.E— >E is uniquely 
determined by its restriction to F SC (E). 

10. Prove that if F < E is finite and expchar(F) = p then [E:F]j is a 
power of p. 

11. Show that lifting an extension by a purely inseparable extension 
does not affect the separable degree. That is, show that if F < E is 
algebraic and F < P is purely inseparable then [EP:P] S = [E:F] S . 

12. Let F < S be finite separable and F < P be finite purely 
inseparable. Prove that P < SP is separable and [SP:P] = [S:F]. In 
fact, if B is a basis for S over F, prove that it is also a basis for SP 
over P. 

Show that if F < E is finite and F < S is finite separable then 
[ES:S]i = [E:F] ; . 
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14. Let F < E be a finite extension and let a £ E be algebraic over F. 
Let H be the set of embeddings of E into E over F. The elements 
of H permute the roots of p(x) = min(a,F). Let /? be a root of 
p(x). Show that | {a E H | aa = /?} | = [E:F(a)] s . Hence, the 
multiset {aa | a E H} contains [E:F(a)] s copies of each root of 
p (x). 

15. Let F < E be a finite extension that is not separable. Show that 
for each n > 1 there exists a subfield E n of E for which E n < E is 
purely inseparable and [E:E n ]j = p n . 

16. Prove that if F ^ pcl( F) then the extension F < pcl( F) is infinite. 




Part 2 

Galois Theory 




Chapter 5 

Galois Theory I 



5.1 Galois Connections 

The traditional Galois correspondence between intermediate fields 
and subgroups of the Galois group is one of the main themes of this 
book. We choose to approach this theme through a more general 
concept, however. 

Definition Let P and Q be partially ordered sets. A Galois connection 
on the pair (P,Q) is a pair (II, fi) of maps II:P— >Q and Q:Q— >P, where 
we write II(p) = p* and fl(q) = q', with the following properties: 

1) (order reversing) For all p E P, q E Q, 

p < q => p* > q* and r < s => r' > s' 

2) For all p E P, q E Q, 

P < P* # an d q < q'* D 

Theorem 5.1.1 For any p G P and q G Q, we have 

1) p*'*=p*, 

2) q'*' = q'. 

Proof. Since p < p* ; , the order reversing property of * gives 

p*'*<p*<(p7* 

from which part 1) follows. Part 2) is similar. I 
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Corollary 5.1.2 The map p— >p*' is a closure operation on P, that is, if 
we denote p*' by c/(p), then for all p G P, q E Q, 



1) (Extensive) 

2) (Idempotent) 

3) (Isotone) 



P < c/(p) 
c/(c/(p)) = c/(p) 

P < q =» cl( p) < c/(q) 



Similarly, the map q— »q'* is a closure operation on Q. 0 



Definition An element p G P is said to be closed if c/(p) = p, and 
similarly for Q. We denote the set of all closed elements in P by C7( P), 
and similarly for Q. 0 



Theorem 5.1.3 The image of any element under II or Q is closed. In 
addition, the maps II and are order-reversing bijective inverse maps 
between the sets C/( P) and Cl( Q). 

Proof. Theorem 5.1.1 shows that the image of an element under II or D 
is closed. Moreover, if q G Cl( Q) is closed, then q' G C7(P) and Ilq' = 
q'* = c/(q) = q is in the image of II, and so II maps Cl{ P) onto C7(Q). If 
p, r G C7(P) and p* = r*, then p*' = r*', that is, p = r. Hence II is 
injective. A similar argument applies to 0. Finally, since p*' = p for 
p G C7(P), it follows that Q o n = i on C7( P) and similarly, n o Q = i on 
C/(Q). I 

Theorem 5.1.4 Let n:P— >Q and D:Q— ►P be a Galois connection, where 
P and Q are lattices. 

1) If pj G C7(P) and Ap x exists in P, then ApjG Cl( P). If P is a 
complete lattice then so is C7(P), with meet given by meet in P. 
Similar statements hold for Q. 

2) De Morgan’s Laws hold in C7( P) and C7( Q). That is, for p, 
q G C7(P) and r, s G C7(Q), we have 

(P A q)* = p* V q*, (p V q)* = p* A q* 

(r A s)' = r' V s', (rVs)' = r'As' 

Proof. For part 1), suppose that pj G C/(P) and A p^ exists as a meet in 
P. Since Apj<pj for all j, we have c/( A pj) < c/(pj) = pj, whence 
c/( A p^ < A pj. Since the reverse inequality holds as well, we have 
equality, whence Apj G C7(P). It follows from Theorem 0.1.1 that if P 
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is a complete lattice, so is C7(P), under meet in P. 

For part 2), observe first that pAq<p and pAq<q imply that 
(p A q)* > p* and (p A q)* > q*, whence (p A q)* > p* V q*. If r > p* and 
r > q* for r £ C7(P) then r' < p and r' < q, whence r' <pAq. Thus, 
r > (p A q)*. It follows by definition of join that (p A q)* = p* V q*. The 
other parts of De Morgan’s Laws are proved similarly. I 

Let Z + denote the set of positive integers. 

Definition We will say that a Galois connection (II, fi) on (P,Q) is 
indexed if the following hold. For each p, q £ P with p < q, there exists 
a number (q:p) GZ + U {oo}, called the degree of q over p. Similarly, for 
each r, s £ Q with r < s, there exists a number (s:r) £ Z + U {oo}, called 
the degree of s over r. Moreover, the following properties hold. 

1) (Degree is multiplicative) If s v s 2 , s 3 £ P or s v s 2 , s 3 £ Q then 

Sj < s 2 < S 3 => (s 3 : Sl ) = (s 3 :s 2 )(s 2 :s 1 ) 

2) (II and Q are degree-nonincreasing) If p, q £ P then 

p < q =» (p*=q*) < (q=p) 

If r, s £ Q then 

r < s => (r'is') < (s:r) 

3) If s, t £ P or s, t £ Q then 



(s:t) = 1 => s = t 



If (s:t) < oo, then s is said to be a finite extension of t. (We observe 
some obvious understandings about oo; for instance, n < oo for all n £ 
oo < oo, n • oo — oo for n £ Z"^" and oo < k < oo implies k — oo.) D 

From now on, when writing (p:q), it is with the tacit assumption 
that p < q. While II and Q are degree-nonincreasing in general, these 
maps are degree preserving when restricted to C7(P) and C/( Q), as we 
now show. 

Theorem 5.1.5 Let (II, Q) be an indexed Galois connection on (P,Q). 

1) If p, q £ C7(P) and p < q then (q:p) = (p*:q*). 

2) If p £ C7(P) and (q:p) < oo then q £ C7(P). In particular, if 0 is 
closed and (1:0) is finite then all elements are closed. 

Similar statements hold for Q. 
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Proof. If p E C/( P) then p = p*' and so 

(q:p) > (P*=q*) > (q* V') = (q*':p) = (q*':q)(q:p) 

If q E C/(P) then q*' = q, equality holds throughout and part 1) is 
proved. If (q:p) is finite then we may cancel to get (q*':q) = 1, which 
implies that q = q*' is closed. This proves part 2). I 

Thus, an indexed Galois connection induces a degree-preserving, 
order-reversing bijection between G/( P) and C7(Q). 



5.2 The Galois Correspondence 

Now we describe one of the most important Galois connections. 

Definition The Galois group of an extension F < E, denoted by Gp(E), 
is the group of all automorphisms of E over F. 0 

Note that when F < E is algebraic, Theorem 2.8.2 implies that 
G F (E) = //0mp(E,E). 

Let F < E and let 5 be the complete lattice of all intermediate fields, 
that is, fields K such that F < K < E, ordered by set inclusion. Let Q be 
the complete lattice of all subgroups of the Galois group Gp(E), ordered 
by set inclusion. We define two maps ILSF— and by 

n(K) = g k (e) 

and 

ft(H) = F( H) = {a E E | (jol = a for all a E H} 
where F(H) is the fixed field of H. 

Theorem 5.2.1 Let F < E. The pair of maps (II, Q) defined by 
n:Kh-*G K (E) and fi:Hh-*F(H) is a Galois connection. We refer to it as 
the Galois correspondence of F < E. 

Proof. It is clear from the definitions that 

K C J => Gj(E) C G k (E) 

and 

H C I => F(l) C F( H) 

Also, any element of K is fixed by every element of G K (E), that is, 
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K C .F(G k (E)) 

Finally, any a £ J fixes every element in F( J), that is, 

J Q J)(^) * 

Since 5 and Q are complete lattices, Theorem 5.1.4 gives 

Corollary 5.2.2 The set Cl( *$) of closed intermediate fields and the set 
C/(Q) of closed subgroups of Gp(E) are complete lattices, where meet is 
intersection. In particular, the intersection of closed intermediate fields 
is closed and the intersection of closed subgroups is closed. 0 

We would like to show that the Galois correspondence of an 
algebraic extension F < E is indexed, where (K:L) = [K:L] is the degree 
of the extension F < E and (H:J) is the index of the subgroup J in the 
group H. It is not hard to see that these degrees satisfy the first and 
third properties in the definition of an indexed correspondence. The 
next theorem shows that the map ILKi— >G K (E) is degree-nonincreasing. 

Theorem 5.2.3 Let F < E be algebraic and let F < L < K < E. Then 

(5.2.1) (G l (E):G k (E)) < [K:L] S < [K:L] 

1) If F < E is normal, then equality holds in the first inequality in 
(5.2.1) and the map ^’-G^E)— ►tfom L (K,E) defined by xfra = a | K 
induces a bijection 

~ Hom h ( K,E) 

2) If F < E is both normal and separable, then equality holds 
throughout (5.2.1). 

Proof. If cr, r G (? L (E), following are equivalent 

CT lK = r lK 

r _1 aa = a, for all a G K 
r -1 (r G Gk(I3) 

<t G t(? k (E) 

Thus, rpa = xj)T if and only if a and r lie in the same coset of G K (E). 
Hence ^ induces a bijection from the set of cosets of G K (E) in G L (E) 
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onto /m(V0* Since 

/m(V>) C #om L ( K,E) C Hom L ( K,E) 

we get 

(G l (E):G k (E)) = | Im(4 ,) | < | ffom L (K,E) | 

< | Hom h { K,E) | = [K:L] S 

which proves the first part of the theorem. 

To prove part 1), suppose that F < E is normal. Then L < E is also 
normal. If a £ K,E) then a can be extended to a £ #om L (E,E) = 

G l (E). It follows that <7 maps K into E, whence a £ Hom^ K,E) and so 
Hom^ K,E) = #om L ( K,E). Moreover, since any a £ Hom^ K,E) can be 
extended to if £ G L (E) and since a = tplr £ /m(^)» it follows that 
Im{xj)) = ^om L (K,E) and so equality holds in both inequalities in the 
previous display. This proves part 1). Part 2) is clear. I 

To show that the map is degree-nonincreasing, we require a 

preliminary result. 

Theorem 5.2.4 Let F < E. Let H C Gp(E). For a £ E, define a:H— »E by 
aa = aa (thus, a is evaluation at a). Then Oj,...,^ are linearly 
independent over F(H) if and only if aj,...,^ are linearly independent 
over E. 

Proof. Suppose that the cq’s are independent over E, and let £ ajcq = 0 
where aj £ F(H). Then for any cr £ H, 

0 = <r(£ a i<*i)= E a i(^«i) = 

Hence a i<*i = 0, implying that aj = 0 for all i. 

For the converse, suppose that <^,..., 0 ^ are independent over F( H) 
and let ^Xjcq^O on H. If this equation has a nonzero solution 
XiJ-'.jXn in E, consider a solution with the fewest number of nonzero 
entries and, by renumbering if necessary, assume the nonzero entries to 
be x v ...,x s . Dividing by x s if necessary, we may also assume that 
x s = 1. Thus 

(5.2.2) x 1 a 1 + --- + x s _ 1 a s _ 1 + a s = 0 
Equation (5.2.2) is equivalent to 

(5.2.3) x 1 («7a 1 ) + --- + x s _ 1 (<ra s _ 1 ) + <7a s = 0 
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for all a G H. In particular, for cr equal to the identity map, we get 

x 1 a 1 +--. + x s _ 1 a s _ 1 +a s = 0 

which implies, owing to the independence of the cq’s, that not all of the 
X;’s can lie in F(H). Let us assume that Xj 0 /'(H). 

Applying r € H to (5.2.3) gives 

r ( x l)( r <™i) + • • • + ’’( x s-i)( ro '« s -i) + r<ra s = 0 

for all <r E H. But as cr varies over the subgroup H so does rcr and so 

r ( x l)(«*i) + • • • + T(x s _i)(o-ar s _ 1 ) + cra s = 0 

for all <r,r E H, or equivalently, 

(5.2.4) (*■*!)«! + ’ • ' + (tXb-OS.-! + a s = 0 

Since Xj g /’(H), we may choose r G H such that r(xj) ^ x v Finally, 
subtracting (5.2.2) from (5.2.4) gives 

[(rxi) - xJSj + • • • + [(ra^) - x,_ 1 ]S._ 1 = 0 

which is shorter than (5.2.2). This contradiction completes the proof. | 

Now we can show that the map Hi— >E(H) is degree-nonincreasing. 

Theorem 5.2.5 Let F < E be algebraic and let H and J be subgroups of 
G f (E) with JCHC G F (E). Then 

[E(J):E(H)] < (H:J) 

Proof. If (H:J)=oo there is nothing to prove, so let (H:J) = r < oo. 
Choose one <r 1 from each coset of J in H, for i = l,...,r. Let 
<*V-iOi n e F(J) be linearly independent over F(H) and assume for the 
purposes of contradiction that n > r. The system 

x l(«l^l) + x 2 («2*l) + ' • ’ + *n(«n ff l) = 0 



x l(“l*r) + + ’ ’ • + x n (“nO = 0 

has more unknowns than equations and so it has a nonzero solution 
x l>---,x n in E. Hence, there exist P v ...,P n € E, not all 0, such that 
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(5.2.5) /?:(« i^j) + • • • + /? n (a n °'i) = 0 

for all i = 

Now, any r E H has the form r = a^p where p E J. Since c*j E F(J), 
we have poc j = c*j and so 

ojr = afap) = (<rjp)(atj) = ff i a j = 

Hence, it follows from (5.2.5) that 

/? 1 a 1 +-.- + ^ n a n = 0 

as a map on H. This contradicts the previous theorem, which says that 
the cq’s are independent over E. Hence n < r. I 

Thus, the Galois correspondence of an algebraic extension F < E is 
indexed. As a consequence, we have the following theorem. 

Theorem 5.2.6 Let F < E be algebraic and let (n,fl) be the Galois 
correspondence of F < E. Then (n,ft) is indexed. Hence 

1) n and Q are degree-nonincreasing, order-reversing maps. 

2) n and Q are degree-preserving, order-reversing bijections (inverses 
of each other) between the lattice C7(?) of closed intermediate 
Fields of F < E and the lattice C7(Q) of closed subgroups of the 
Galois group Gp(E). More specifically, 

a) IfF<L<K<E with K, L closed then 

[K:L] = (G l (E):G k (E)) 

b) If J C H C G f (E) with H, J closed then 

(H:J) = [F(3):F(R)] 

c) For K, L E C/(5) and H, J E C7(Q), we have 

<WE) = g k { E) V G L (E), g k VL (E) = Gk(e) n g l (e) 
F( H n J) = F( H) V F(J), F( H A J) = f(H) n F(3) 

In addition, any finite extension of a closed intermediate field or closed 
subgroup is closed. 0 

We should note that the joins in part 2c) of the previous theorem are 
joins in the corresponding lattices. Thus, for instance, Gj^(E) V G L (E) is 
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the smallest closed subgroup of Gp(E) containing G K (E) and G L (E), 
which need not be the smallest subgroup of Gp(E) containing these 
groups. (In other words, need not be a sublattice of and G/(Q) 

need not be a sublattice of Q.) 

Corollary 5.2.7 If F < E is finite then | Gp(E) | < oo. 0 



5.3 Who’s Closed? 

We turn our attention to the question of which intermediate fields of 
an extension and which subgroups of the Galois group are closed. 

Definition A normal separable extension is called a Galois extension. □ 

The next theorem follows from the relevant properties of normal and 
separable extensions. 

Theorem 5.3.1 

1) Let F < K < E. If F < E is Galois then K < E is Galois. 

2) The class of Galois extensions is closed under lifting: If F < E is 
Galois and F < K then K < EK is Galois. 

3) The class of Galois extensions is closed under arbitrary composites 
and intersections: If F < Ej are Galois and V Ej is defined then 
F < V Ej is Galois and F < f] Ej is Galois. 0 

It is not hard to describe the closed intermediate fields of an 
algebraic extension F < E. 

Theorem 5.3.2 Let F < E be algebraic and consider the Galois 
correspondence on F < E. 

1) An intermediate field K is closed if and only if K < E is a Galois 
extension. 

2) If K is closed and K < L < E then L is also closed. 

3) The following are equivalent. 

a) F is closed. 

b) F < E is a Galois extension. 

c) All intermediate fields are closed. 

Proof. According to Theorem 4.7.4, if K < E is normal then c/(K) = 
F(G k (E)) = K 1C , the purely inseparable closure of K in E. Hence, if 
K < E is Galois then c/(K) = K. For the converse, suppose that K is 
closed. Let a £ E with p(x) = min(a,K) of degree n. Since [K(c*):K] is 
finite, we know that K(a) is closed and 
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n = [K(q):K] = (G K (E):G K(a) (E)) 

Let . . . , a n be a complete set of coset representatives of G K(a)( E ) in 
G k (E). It is easy to see that r E ^^(^(E) ^ and only ^ T0L — a i a - 
Hence there are precisely n distinct images of a under the Galois group 
G k (E). But each of these images is a root of the minimal polynomial 
p(x) and so p(x) is separable with all of its roots in E. Hence K < E is 
both separable and normal. This proves statement 1) and shows that 
3a) and 3b) are equivalent. All of the statements in 3) are equivalent 
since F < E is Galois if and only if K < E is Galois for all intermediate 
fields K. Similarly, 2) follows from 1). I 

Note that if F < E is algebraic then E is closed since E = F((a)) 
where i 6 Gp(E) is the identity. 

If K is a closed intermediate field, then 

[E:K] = (G k (E):G e (E)) = | G K (E) | 

In the finite case, the converse also holds. 

Theorem 5.3.3 Let F < E be a finite extension. 

1) An intermediate field K is closed if and only if [E:K] = | G K (E) | . 

2) The following are equivalent, 
a) F < E is Galois. 

c) F is closed. 

d) All intermediate fields are closed. 

e) [E:K] = | C? K (E) | for all intermediate fields K. 

f) [E:F] = | G F (E) | . 

Proof. We have seen that K closed implies [E:K] = | G K (E) | . 
Conversely, if [E:K] = | G K (E) | then 

[E:K] = (G k (E):G e (E)) = [E(G e (E)):E(G k (E))] = [E:E(G K (E))] 

and so the finiteness of F < E implies that K = F(G K (E)), that is, K is 
closed. Part 2) follows from the previous theorem. I 

As for the matter of which subgroups are closed, let F < E be 
algebraic. Since the trivial subgroup G e (E) = (i) is closed, any finite 
subgroup of Gp(E) is closed. Thus, if F < E is finite then Gp( E) is finite 
and all subgroups are closed. We may now give a complete answer to 
the question of who’s closed in the finite case. 
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Theorem 5.3.4 If F < E is finite then all subgroups of the Galois group 
Gp(E) are closed and an intermediate field K is closed if and only if 
K < E is a Galois extension. In particular, if F < E is Galois then all 
intermediate fields are closed. 0 

As the next example shows, in the general algebraic case, not all 
subgroups need be closed. 

Example 5.3.1 For this example, we borrow from a later chapter the 
fact that for any prime power p d , there exists a finite field GF( p d ) of 
size p d and GF( p d ) < GF( p r ) if and only if d | r. 

Let F = Z p and let E = F be an algebraic closure of F. Since F is a 
finite field, it is perfect and so F < E is separable. Since E is 
algebraically closed, F < E is normal. Hence F < E is a Galois extension 
and therefore F is closed. Let H = (<7 p ) be the subgroup of Gp(E) 
generated by the Frobenius map *a p . The fixed field F(H) is the 

set of all a E E for which a p = a, m other words, the roots in E of the 
polynomial p(x) = x p — x. But p(x) has p roots in F and so F( H) = F. 
It follows that 



c/(H) = G m (E) = G f ( E) 

Hence, all we need do is show that H ^ Gp(E) to conclude that H is not 
closed. 

Let q be a prime and consider the field 

P = GF( p q ) U GF( p q2 ) U GF( p q3 ) U • • • 

Then P is a proper subfield of E, since it does not contain, for instance, 
the subfield GF(p q+1 ). Hence [E:P] > 1 and since P < E is Galois, the 
group G P (E) is not trivial. Let a E Gp(E). If a E H then a = <r p for 
some k and so 

F(((t)) = {a E E | <r p a = a} = {a E E | a p = a} 

is the set of roots in E of the polynomial x pk — x. Hence F((a)) is finite. 
But F((a)) contains the infinite set P. This contradiction implies that 
<7 ^ H and so H ^ G F (E). □ 

The Galois correspondence begins with a field extension F < E and 
the corresponding Galois group Gp(E). We may also begin with a field 
E and a group G of automorphisms of E. Then we can form the fixed 
field 



F( G) = {a E E | <ra = a for all a E G} 
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and consider the Galois correspondence (11,(2) on F( G) < E. Since G is a 
subgroup of G f , G \( E), it is in the domain of the map (2 and so F(G) = 
(2(G). Hence, F(G) is closed under the Galois correspondence and so 
F( G) < E is a Galois extension. If G is closed, which happens, for 
instance, when G is finite, then G F ^(E) = G. 

Theorem 5.3.5 Let E be a field and let G be a group of automorphisms 
of E. Then the extension F(G) < E is Galois. If G is closed (for 
example, if G is finite) then G = G F ^(E), □ 



5.4 Normal Subgroups and Normal Extensions 

If F < E is normal and K is an intermediate field, we know that 
K < E is also normal, but F < K need not be. However, we can neatly 
describe when F < K is normal in terms of Galois groups. (This is an 
example of the power and purpose of Galois theory.) 

Suppose first that F < K is normal (F < E need not be normal). 
Since any r E Gp(E) sends K onto itself, it follows that t~ 1 <tt E G k (E) 
for any a E G K (E), that is, G K (E) is a normal subgroup of Gp( E), in 
symbols, G K (E) < Gp(E). 

Conversely, suppose that G K (E) < Gp(E). We want to show that 
F < K is normal. Let a E K have minimal polynomial p(x) over F. If /? 
is any other root of p(x), then Theorem 2.8.4 implies the existence of a 
r E Homp(E,E) such that rot = /?. If F < E is normal, then r E Gp(E). 
If a E G k (E), the normality of G K (E) implies that ar = rcr' for some 
o' E G K (E) and so 

cr/3 — a rot — to 1 a = rot — (3 



Thus, cr fixes all of the roots of p(x) and so all of the roots of p(x) lie in 
F(G k (E)). If K is closed, then all of the roots of p(x) lie in K and so K 
is normal over F. We have proven most of the following. 

Theorem 5.4.1 Let F < K < E. 

1) If F < K is normal then G K (E) < Gp(E). 

2) If F < E is normal, K < E is Galois and G K (E) < Gp(E) then 
F < K is normal. 

3) If F < E is Galois then F < K is normal if and only if 
G k (E)<G f (E). 

Moreover, if F < K and F < E are normal, the map xl>:G F (E)--+G F (K) 
defined by 



rpCT-HJ I K 
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is an epimorphism whose kernel is G K (E). Thus, 



G f (K) ~ 



Gf(E) 
Gk( E) 



Proof. We need only prove the last statement. Let or £ G F (E). Since 
F < K is normal, the restriction cr | E , being an embedding of K into E 
over F, is an automorphism of K and thus lies in Gp(K). Hence maps 
G f (E) to Gp(K). Moreover, for <r, r £ Gp(E), we have 

(<rr) | K = <r(r | K ) = (<r | K )(r | K ) 



which shows that is a group homomorphism. The kernel of is 
G k (E) since if cr £ G F (E) then <j | K = i if and only if cr £ G K (E). 
Finally, the map x/j is surjective since the normality of F < E implies 
that any cr £ Gp(K) can be extended to an element of Gp(E), whose 
restriction to K is <r. I 



5.5 More on Galois Groups 

We now examine the behavior of Galois groups under lifting and 
under the taking of composites. We assume that all composites 
mentioned are defined. 

Theorem 5.5.1 (The Galois group of a lifting) Let F < E be Galois and 
let F < K. Then K < EK is Galois. Moreover, the restriction map 
^:G k (EK)— *G K nE (E) defined by xpcr = or | E is an isomorphism. Thus 



G K (EK)~G KnE (E) 

In addition, 

1) K O E = F implies G k (EK) - G F (E). 

2) If F < E is finite, then G k (EK) ~ Gp(E) implies K 0 E = F. 

Proof. We have already seen that K < EK is Galois. The normality of 
F < E implies that t/> is a homomorphism from G k (EK) into G KnE (E). 
If cr £ G k (EK) and cr | E = i then cr fixes E as well as K and so it fixes 
all elements of EK, whence cr = l Thus xj; is injective. It remains to 
show that Imxp — ^KnE^)* 

To avoid confusion, let us use the notation F E ( * ) for the fixed field 
with respect to the Galois correspondence on F < E, and F EK ( • ) for the 
fixed field with respect to the Galois correspondence on K < EK. Since 
K < EK is Galois, we deduce that K is closed with respect to the Galois 
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correspondence on K < EK and so 

F E (/m VO = {<* £ E | ra = a for all r E /m(V > )} 

= {ot G E | (cr | E )a = a for all a E G k (EK)} 

= {a E E | era = a for all a E G k (EK)} 
-En^ EK (G K (EK)) 

= EflK 

Now, if we show that Im V> is closed with respect to the Galois 
correspondence on F < E, it follows by taking Galois groups that 



Imip — G Kn e (E) 



and thus %!> is surjective, completing the proof. If F < E is finite, then all 
subgroups of the Galois group G e (E) are closed, and we are finished. 
We will postpone the proof in the infinite case until we have discussed 
the Krull topology, later in this chapter. 

Finally, statement 1) is clear. As to statement 2), we have 
G e (E) G KnE (E) and since G KnE (E) < G F (E) with both finite, we 
deduce that G E nE (E) = Gp(E), whence KflE = F follows by taking 
fixed fields. I 

Theorem 5.5.1 yields a plethora of useful statements about degrees, 
all of which can be read from Figure 5.5.1. We leave details of the proof 
to the reader. [Part 3) of the next result is particularly useful.] 




Figure 5.5.1 

Corollary 5.5.2 Suppose that F < E is finite Galois and F < K, with EK 
defined. Then 

1) [EK:K] = [E:E fl K] and [EK:K] | [E:F]. 
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If F < K is also finite then 

2) [EK:F] = [E:EflK][K:F] and [EK:F] | [E:F][K:F]. 

3) [EK:F] = [E:F][K:F] if and only if E n K = F. 

More generally, if F < Ej is finite Galois for i = 1, . . . ,n — 1 and F < E n 
is finite then 

4) [E 1 ...E n :F]=ft[E i :E i n(E i+1 -E n )] 

i=l 

where E i+1 - • -E n = F when i = n. 

5) [Ej- • -E n :F] = f[ [E S :F] if and only if E; D (E i+1 - • -EJ = F for all i, 

i=l 

where E i+1 - • -E n = F when i = n. 0 

We now turn to the Galois group of a composite. 

Theorem 5.5.3 (The Galois group of a composite) Let <5 = {Ej | i E 1} 
be a family of fields, all contained in a larger field. If F < Ej is Galois 
over F for all i E I, then the composite V Ej is Galois over F. If G = 
Gp(Ej) is the direct product of the Galois groups Gp(Ej) and if 
►G f p(E i ) is projection onto the i-th coordinate, then the map 



^(VEjHnGtfEi) 

defined by 

»i W”) = a I E. 

is a monomorphism of groups. Hence, Gp( V E|) is isomorphic to a 
subgroup of the direct product f] Gp(Ej). 

Moreover, if *3F = {E 1 ,...,E n } is a finite family of finite extensions, 
then the following are equivalent 
1) ^ i s an isomorphism and 



Gp{ Ej v • • • V E n ) ~ G f ( E,) X ... x Gp(E n ) 

2) E| fl (E i+1 - • -E n ) = F for all i = 1, . . . , n. 

Proof. Let K = V E|. We have already seen that F < K is Galois. Let 
cr E Gp(K). Since each F < Ej is normal, we have cr | E E G F (E i ). If r E 
Gp(K) then 1 

*i(W<7T)) = (<rr) | E . = (<r | E .)(r | E .) = 7r i (V'<r)7r i (V'r) = ir i [(V'<r)(V’r)] 

and so t fi(<rr) = (^cr)(i/>r). Thus, rp is a homomorphism of groups. If 
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cr | £ . = t for all i E I, then since each element of K is a rational function 
(over F) in finitely many elements of [JE we must have cr = a, whence 
ip is injective. 

When is a finite family of finite extensions, all Galois groups are 
finite. It follows that 

\IrmP\ = IGpfVEi)! = [VE i: F] 

and 



I n Gf(Ej) I = niGp(Ei)| =niEi:F] 



Hence ip is surjective if and only if [ V Ej:F] = [E^F] and Corollary 

5.5.2 gives the desired result. I 

The following corollary will prove useful. 

Corollary 5.5.4 Suppose that F < E is a finite Galois extension with 
Galois group of the form 



If 



G = G F (E) = G x x • • • x G n 
Hj = Gj x • • • x {4 x • • • x G n 



where {*} is in the i-th coordinate and if Ej = F( Hj) then 

1) F < Ej is Galois with Galois group Gp(Ej) ~ G|, 

2) E = E x V • V E n , 

3) Ej fl (E i+1 * • -E n ) = F for all i = l,...,n. 

Proof. Since F < E is finite and Galois, all intermediate fields and all 
subgroups of G are closed. Since Hj«G, it follows from Theorem 5.4.1 
that F < E; is a Galois extension and 



~ g£(E) “ Hi - G i 



In addition, F < V Ej is Galois. Since 

Gve/e) = n g e .( e) = riHj = w = g e (e) 

taking fixed fields gives V Ej = E. Hence, Gp( V Ej) ~ Gp(Ej) and 
Theorem 5.5.3 implies that Ej fl (E i+1 - • -E n ) = F for all i = 1, . . . , n. I 
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Abelian and Cyclic Extensions 

Extensions are often named after their Galois groups. Here is a very 
important example. 

Definition An extension F < E is abelian if it is Galois and if the Galois 
group Gp(E) is abelian. An extension F < E is cyclic if it is Galois and 
if the Galois group Gp(E) is cyclic. 0 

The basic properties of abelian and cyclic extensions are given in the 
next theorem, whose proof is left as an exercise. 

Theorem 5.5.5 

1) If F < E and F < K are abelian, then F < EK is abelian. 

2) If F < E is abelian (cyclic) and F < K, then K < EK is abelian 
(cyclic). 

3) If F < K < E with F < E abelian (cyclic), then F < K and K < E 
are abelian (cyclic). D 



*5.6 Linear Disjointness 

If F < K and F < L are finite extensions, the degree [KL:F] provides 
a certain measure of the “independence” of the extensions. Assuming 
that [K:F] < [L:F], we have 

[L:F] < [KL:F] < [L:F][K:F] 

The “least” amount of independence occurs when [KL:F] = [L:F], or 
equivalently, when K < L and the “greatest” amount of independence 
occurs when 

(5.6.1) [KL:F] = [K:F][L:F] 

We have seen (Corollary 5.5.2) that, if one of the extensions is Galois, 
then (5.6.1) holds if and only if KflL = F. For finite extensions in 
general, we cannot make such a simple statement. However, we can 
express (5.6.1) in a variety of useful ways. For instance, (5.6.1) holds 
for arbitrary finite extensions if and only if whenever {a?.} C K is 
linearly independent over F and {/3j} C L is independent over F then 
{ a? ♦ /?j } is also independent over F. 

To explore the situation more fully (and for not necessarily finite 
extensions), it is convenient to employ tensor products. (All that is 
needed about tensor products is contained in Chapter 0.) 
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Let F < K and F < L. The multiplication map cr: K x L—*KL defined 
by <7(c*,/?) = a/? is bilinear and so there exists a unique linear map 
xp:K 0 L— >KL for which xp(a (8) /3) = a/?. (The tensor product is over F.) 
This map is a morphism of F-algebras, since 

xp [( ol (8) /?)(t 0 (5)] = xp(ay 0 /36) = c*7/?5 = (a/?)(7<5) = ® /?)V>(7 0 $) 

Note that the image of xp is the F-algebra K[L] = L[K] of all elements 
of the form 

Ml + - + Mn 

for kj G K and G L. Hence, if F < K or F < L is algebraic, then KL = 
K[L] and so the map xp is surjective. 

If F is a field, we use the term F-independent to mean linearly 
independent over F. 

Theorem 5.6.1 Let F < E and suppose that K and L are intermediate 
fields. The following are equivalent. 

1) The linear map xp defined above is injective. 

2) If {a-} C K is F-independent then it is also L-independent. 

3) If {aj C K and {/?j} C L are both F-independent then {cq/Jj} is 
also F-independent. 

4) If {ofj} is a basis for K over F and {/?j} is a basis for L over F 

then is a basis for K[L] over F. 

5) There is a basis for K over F that is L-independent. 

If F < K and F < L are finite, then each of 1) to 5) is equivalent to 

6) [KL:F] = [K:F] [L:F], 

If F < K and F < L are finite and one is Galois, then each of 1) to 6) is 
equivalent to 

7) K fl L = F. 

Proof. [l=>-2] Let {aj C K be F-independent and suppose that 
^2 = 0 for G L. Since xp is a monomorphism and 

*l>( 12 ® <*0 = £ = o 

we have 

J2 ® <*i = 0 

Theorem 0.8.2 now implies that = 0 for all i. 

[2=^3] Let {aj and {/?j} be F-independent. If 
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E a iM = o 

ij 

with a- E F then since {aj is also L-independent, the coefficients of a- 
must equal 0, that is, 

EVi=° 

j 

for all i. Since the /?j’s are also F-independent, we get ay = 0 for all i, j. 

[3=>4] This follows from the fact that if {aj spans K over F and 
{/ ?•} spans L over F then {cq/?j} spans K[L] over F. 

[4=3-1] The map ^ sends a basis {a ; 0 /?j} for K 0 L to a basis {c^/Jj} 
for K[L] and is therefore injective. 

Thus, each of 1) to 4) is equivalent, and by symmetry we may add 
the equivalent statement that any F-independent subset of L is also K- 
independent. It is clear that 2) implies 5). 

[5=3-2] Let {oj} be a basis for K over F that is L-independent. Let 
{/?j} be an F-independent subset of L. We show that {/ 3 j) is also K- 
independent. Let Si /C ii®i == ® where /Cj E K. Then k- x = J^a-jO*, where 
ay G F, and so 

EEVi°i =0 

1 j 

But the cq’s are L-independent and so 

E a i/j = ° 

j 

for all i. Hence ay = 0 for all i, j. It follows that = 0 for all i, whence 
{/?j} is K-independent. 

[1<36] In the finite (hence algebraic) case, we have remarked that the 
map t/r.K 0 L— >KL is surjective and so it is also injective if and only if 
dim K 0 L = dim KL, which by Corollary 0.8.5 is equivalent to 

(dim K )(dim L) = dim KL 

all dimensions being over F. 

[6<37] This follows from Corollary 5.5.2. I 

Definition If any of the equivalent conditions hold in Theorem 5.6.1, we 
say that K and L are linearly disjoint over F. □ 
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*5.7 The Krull Topology 

We have seen that if F<E is a finite Galois extension, then all 
subgroups of the Galois group Gp(E) are closed but if F < E is infinite 
and Galois, this need not be true (see Example 5.3.1). The use of the 
term closed suggests the presence of a topology, which we now define. 

Definition Let E e be the set of all functions from E into E. We define a 
topology on E E , called the finite topology, by specifying as subbasis 
all sets of the form 



S U)V = {f:E—*E | fu = v} 

where u, v G E. A basis for ¥ thus consists of all sets of the form 
{f:E-»E | fu* = v 2 , . . . , fu k = v k } 



where u i? Vj G E. 0 



Of course, if F < E, then the Galois group Gp(E) is a subset of E E . 

Theorem 5.7.1 If F < E is algebraic then Gp(E) is closed in the finite 
topology. 

Proof. We show that any f G E e that lies in the closure c/(Gp(E)) of the 
Galois group is actually in Gp(E). A basic open neighborhood of f has 
the form 

{g€E E |gu 1 =fu 1 ,...,gu k = fu k } 

and so f G c/(Gp(E)) implies that for any Uj,...,^ G E there is a a G 
Gp(E) for which <ruj = fuj for i = l,...,k. It follows that f is a 
homomorphism. For if u, v G E and a, /? G F then there is a a G Gp(E) 
for which 



<TU = fu, (TV = fv, 

<r(au + /?v) = f(c*u + /?v), cr(uv) = f(uv) 

Hence, 

f(au + /?v) = <r(au + /?v) = aau + /3<rv = afu + /?fv 

and 

f(uv) = cr(uv) = (<tu)(ctv) = (fu)(fv) 

which shows that f is a homomorphism. Also, fu = 0 implies <ju = 0 for 
some a G Gp(E) and so u = 0, showing that f is injective. Similarly, f 
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fixes F pointwise. Thus, f is an embedding of E into itself over F. Since 
F < E is algebraic, we deduce that f E Gp(E). I 

Thus, if F < E is a Galois extension, the Galois group Gp(E) is 
closed in the finite topology on E E . The subspace topology inherited by 
Gp(E) is called the Krull topology on Gp(E). It follows that a subset of 
Gp(E) is closed in the Krull topology if and only if it is closed in the 
finite topology on E e . 

To avoid any temporary confusion, we refer to a subset of Gp(E) 
that is closed in the Krull topology as k-closed and a subgroup of Gp(E) 
that is closed in the sense of the Galois correspondence as g-closed. 
Similarly, we use the term k-open for open sets in the Krull topology. 

Let us determine the closure H in the Krull topology of a subgroup H 
of Gp(E). If r E H then given u 1 ,...,u n E E, there is a cr E H for which 
TU| = (TUj, for i = l,...,n. This implies that r fixes any element of the 
fixed field F( H). Hence, r E H if and only if, given u 1 ,...,u n E E, there 
is a <7 E H for which 

r l/'(H)(u 1 ) ...,u n ) = <r lF(H)(u 1 ,...,u n ) 

Since any finite extension of F( H) contained in E has the form 
F(H)(u 1 , . . . ,u n ), we can say that r E H if and only if for any finite 
extension K of F( H) contained in E, there exists a cr E H for which 
7 I K = a I K- 

If F( H) < K is a finite extension and K nc is the normal closure then 
F(H) < K nc is a finite Galois extension. Thus r E H if and only if for 
any finite Galois extension K of F(H) contained in E, there exists a a E 
H for which r | K = a | K . Finally, letting 

H|k = {*Ik:*€H} 

we can say that r E H if and only if for any finite Galois extension K of 
F( H) contained in E, we have r | K E H | K . 

If r E H and K = F( H), we have 

7 I F(H) € H I F(H) = W 
and so r E Gp^( E), the g-closure of H, whence 

H C Gp^ H j(E) 

To see that the reverse inclusion holds, suppose that r E G F ^( E) and 
let F( H) < K be a finite Galois extension contained in E. Since F( H) is 
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contained in K, we have 

F( H) = {a E E | aot = a for all a E H} 

= {c* E K | <ra = a for all a E H} 

= {a E K | aa = a for all a E H | K } 

= m\ K ) 

where F( H | K ) is the fixed field of H | K with respect to the Galois 
correspondence on the Galois extension F(H) < K. (Note that since 
F(H) < K is a Galois extension, if a E H then a is an automorphism of 
E over F( H), whence its restriction a | K is an automorphism of K over 
F( H). Hence, H | K is contained in the Galois group 

Since F( H) = F( H | K ), the extension F(H | K ) < k is finite and 

Galois, implying that H | ^ is g-closed in the Galois correspondence of 

F( H | K ) < K. Hence, 

r I K G G F(H)(K) = G F(h | k )( K ) = H I K 

and so r G H. It follows that C H. Let us summarize. 

Theorem 5.7.2 Let F < E be a Galois extension and let H be a subgroup 
of the Galois group Gp(E). Then the closure G^ H j(E) of H with respect 
to the Galois correspondence on F < E is the closure of H in the Krull 
topology. □ 

Let F < E be a Galois extension. We leave it to the reader to show 
that the composition map 

Gp(E) x Gp(E)h->Gp(E):((j,r)H-xjr 

and the inversion map 



G f (E)i — >G p(E):cn — ht ^ 

are continuous under the Krull topology. Hence, Gp(E) is a topological 
group. In fact, it can be shown that Gp(E) is a compact, totally 
disconnected topological group. 

We conclude this section by completing the proof of Theorem 5.5.1 
in the infinite case. Recall that F < E is Galois and F < K. The map 
V>:G k (EK)— >G p(E) is defined by = a | E and we wish to show that 
Im rp is closed with respect to the Galois correspondence on F < E. 
Theorem 5.7.2 implies that this is equivalent to showing that I = Im xp 
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is closed in the Krull topology on Gp( E). 

Let r E I, the Krull-closure of I. We show that r E I by finding a a e 
G k (EK) for which a | E = r. Let us define cr:EK-->EK as follows. Since 
K < EK is algebraic, any element a e EK is a finite sum of the form 

<*= I> i k i 

where ej E E and kj E K. We set 

aa= £( re i)ki 

The first order of business is to show that this is well-defined. 

To this end, note that since r E I, it follows that for any finite set 
U = {u 1 ,...,u n } C E, there exists a <Tjj E G k (EK) that agrees with r on 
the elements of U, that is, for which 

ajjUi = TU|, for all i 

Hence, if U = {e 1 ,...,e n } then 

0-0!= E^e^k; 

Now suppose that a can also be written as 

“= Ee-k| 

Let V = {ej} C E and let u-\j y y agree with r on U U V. Then 

E ( re i) k i = EKu V e i) k i = (r Uuv(E e i k j) 

= IT Uu v(E e-k() = EKu v e i) k i = E ( re O k i 

Thus, the definition of aa does not depend on the representation of a , 
and a is well-defined. 

Now suppose that 

«1 = Ee^i, a 2 = E e 2i k i ,•••,«„= E e ni k i 

is any finite set of elements of EK and let U = {e-}. If a 1 E G k (EK) 
agrees with r on the elements of U, then 

j = <r ' E e ji k i = £ r ( e ji) k i = a E«ji k i = 

i i i 
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for all j = In other words, for any finite subset S of EK, there is 

an element of G k (EK) that agrees with g on S. 

It follows that g is a homomorphism of EK, for if a, /? £ EK then 
there exists a, a' £ G k (EK) that agrees with a on a, /?, a + /? and a/3 
and since g' is a homomorphism, we have 

a(a + /?) = a\a + /?) = ct'ol + a '/3 = gol + a/3 

and 

cr(a/3) = <r'{a(f) = (a'a){<r'P) = (<ra)(<7/?) 

It also follows that a is injective, for if gol = 0 then there is a g' £ 
G k (EK) such that g’ql = 0, whence ot = 0. The surjectivity of a follows 
from that of r, since if a £ EK, then 

«= £ e iki= E ( Te 0 k i = 

Finally, it is clear from the definition that gol = a for all a £ K and 
that gol — ra for all a £ E. Thus, g £ G k (EK) and g | E = r. This 
completes the proof of Theorem 5.5.1. 

Exercises 

1. If A is an order reversing bijection between two lattices, 
verify that A(a A b) = Aa V Ab and A(a V b) = Aa A Ab. 

2. With respect to a Galois connection, if P is a complete lattice then 
C/(P) is also a complete lattice. 

3. If K < E and L < E are Galois extensions, show that K fl L < E is 
a Galois extension. 

4. Let K and L be subfields of a field E and suppose that K < E and 
L < E are Galois, with Galois groups G 1 and G 2 , respectively. Let 
G 1 G 2 be the join of G x and G 2 in the lattice Q of all subgroups of 

nL (E) and let G x V G 2 be the join of G x and G 2 in the lattice 
Q of all closed subgroups of G K nL (E). Show that G 1 G 2 is finite if 
and only if G K n l (E) is finite, in which case GiG 2 = G x V G 2 . 

5. Let F < E be finite with G = G F (E). Let G x «G 2 < G, with Fj = 
F(G .). Show that G F (Fj) - G 2 /G r 

6. Find an example of an infinite algebraic extension whose Galois 
group is finite. 

7. Prove Corollary 5.5.2. 

8. Let F be a perfect field. Suppose that there is a prime p for which 
p | [E:F] for every proper finite extension E of F. Show that if E is 
a finite extension of F then [E:F] = p n for some n £ N. Apply this 
to the case F = R to deduce that if R < E is a finite extension the 
[E:R] = 2 n for some n £ N. 
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9. Let F < E be a finite Galois extension and let E < K. Then [EK:K] 
divides [E:F]. Use the following to show that the assumption that 
F < E be Galois is essential. Let a be the real cube root of 2, let 
w / 1 be a cube root of 1. Let F = Q, E = Q(au>) and K = Q(a). 

10. Prove the following statements about abelian and cyclic 
extensions. 

a) If F < E and F < K are abelian, then F < EK is abelian. 

b) If F < E is abelian (cyclic) and F < K, then K < EK is abelian 
(cyclic). 

c) If F < K < E with F < E abelian (cyclic), then K < E and 
F < K are abelian (cyclic). 

11. Let F < E and F < K be extensions, with E and K contained in a 
larger field. Show that E and K are linearly disjoint over F if and 
only if E' and K' are linearly disjoint over F for all intermediate 
fields F < E' < E and F < K' < K with [E':F] and [K':F] finite. 

12. Let F < E be a normal extension. Show that the separable closure 
F sc of F in E and the purely inseparable closure F lc of F in E are 
linearly disjoint over F. Moreover, if F < K < E and if K and F 1C 
are linearly disjoint over F then F < E is separable. 

13. Let f(x) G F[x] and let F < E. Let S E be the splitting field of f(x) 
over E. Thus, if Of 1? . - - , Qf n are the roots of f(x) in S E , we have 
S E = E(a 1 ,...,a n ). Let S F = F(a 1 ,...,a n ) and let L = 
S F fl F(G e (S e )). Let ^:G E (S E )->G L (Sp) be defined by ipa = <j \ s ^. 
Show that is an isomorphism. This is known as the Theorem on 
Natural Irrationalities . 

14. Referring to Theorem 5.5.3, show that if is an arbitrary family 
then the map if) is an isomorphism if 

E-nf V E;) = F for all j G I 
J V i*j J 

15. Extend the notion of closure obtained from the Galois extension to 
all subsets of Gp(E), and show that it is a closure operation in the 
sense of topology. 

16. Prove that Gp(E) is a topological group under the Krull topology. 
Show that this topological group is totally disconnected. 

17. Let F < E and suppose that S is a finite set of elements 
algebraically independent over E. Then F(S) and E are linearly 
disjoint over F. 

18. a) Show that in every Galois extension F < E, there is a largest 

abelian subextension F ab , that is, F < F ab < E, F < F ab is 

abelian and if F < K < E with F < K abelian then K < F ab . 
b) If G is a group, the subgroup G' generated by all 

commutators a/fa^ 1 /? -1 , for a, /? G G, is called the 

commutator subgroup. Show that G' is the smallest subgroup 
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of G for which G/G # is abelian. 

c) If the commutator subgroup G F (E) / of a Galois group Gp(E) 
is closed, that is, if (? F (E)' = G K (E) for some F < K < E, then 
K = F ab . 

19. Let F < K and let F < E < L. Assume that K and L are contained 
in a larger field. Then K and L are linearly disjoint over F if and 
only if K and E are linearly disjoint over F and KE and L are 
linearly disjoint over E. 

20. The following concept is analogous to, but weaker than, that of 
linear disjointness. Let F < K and F < L be extensions, with K 
and L contained in a larger field. We say that K is free from L 
over F if whenever SCK is a finite set of algebraically 
independent elements over F, then S is also algebraically 
independent over L. 

a) The definition given above is not symmetric, but the concept 
is. Show that if K is free from L over F, then [KL:L] t = [K:F] t . 
Let T be a finite F-algebraically independent set of elements 
of L. Show that T is algebraically independent over K. 

b) Let F < K and F < E be field extensions, contained in a larger 
field. Prove that if K and L are linearly disjoint over F, then 
they are also free over F. 

c) Find an example showing that the converse of part b) does 
not hold. 




Chapter 6 

Galois Theoiy II 



In this chapter, we pass from the highly theoretical material of the 
previous chapter to the somewhat more concrete, where we consider the 
Galois groups of the splitting fields of specific types of polynomials. 



6.1 The Galois Group of a Polynomial 

The Galois group of a polynomial p(x) E F[x] is defined to be the 
Galois group of a splitting field S for p(x) over F. This group is 
sometimes denoted by Gp(p(x)). If 

p( x ) = PiK*)- • -Pk k W 

is a factorization of p(x) into powers of distinct irreducible polynomials 
over F, then S is also a splitting field for the polynomial q(x) = 
p 1 (x)-*-p k (x). Moreover, the extension F < S is separable (and hence 
Galois) if and only if each pj(x) is a separable polynomial. In particular, 
if p(x) has no multiple roots, then F < S is a Galois extension. 

Note that each a E Gp(S) is uniquely determined by its action on the 
roots of p(x), which generate S, and that this action is a permutation of 
the roots. However, not all permutations of the roots of p(x) need 
correspond to an element of Gp(S). Thus, we have an injective group 
homomorphism from Gp(S) into the symmetric group S n , where n = 
deg p(x). 

Let p(x) = f(x)g(x) where deg f(x) > 0 and let S p be the splitting 
field for p(x) over F and the splitting field for f(x) over F. We clearly 
have F<S f <S p with each step normal. Hence, by Theorem 5.4.1, 
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G Sf (S p )<.G F (S p )and 



or, in another notation, 



Gp(S f ) * 



G f ( S p ) 

G S p p ) 



G F (f(x)) ~ 



G f (p(*)) 

<?s f (p( x )) 



Thus, the Galois group of a nontrivial factor of p(x) is isomorphic to a 
quotient group of the Galois group of p(x). 



6.2 Symmetric Polynomials 

If F is a field and t 1 ,...,t n are algebraically independent over F, the 
polynomial 

g( x ) = ft( x -ti) 

i=l 

is referred to as a generic polynomial over F of degree n. Since the roots 
t 1? . . . , t n of the generic polynomial g(x) are algebraically independent, 
this polynomial is, in some sense, the most general polynomial possible. 
Accordingly, it should (and does) have the most general Galois group, 
as we will see. 

The generic polynomial can be written in the form 

g(x) = X n - S x x n-1 + h (-l) n s n 

where the coefficients s k E F(t 1 ,...,t n ) are given by 

n 

s l =t 1 + -.- + t nl S 2 = S n = IJtj 

i < j i=l 

and are called the elementary symmetric polynomials in the variables tj. 
It follows that the coefficients of any polynomial are the elementary 
symmetric functions of the roots (in a splitting field) of that 
polynomial. 

Since F(t 1 ,...,t n ) is the splitting field for g(x) over F(s 1 ,...,s n ), and 
since g(x) has no multiple roots, we deduce from the remarks of the 
previous section that the extension F(s 1 ,. . .,s n ) < F(t x ,. . .,t n ) is Galois 
of degree at most n!. Moreover, any permutation a E S n of {l,...,n} 
induces a unique automorphism of F(t 1 ,...,t n ) defined by 
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pQ^...,^) POvti)!-- •»<>(„)) 

^O'l’ • • • 1 1„) qCto-fi)* — » t CT-(n)) 

Let us denote the group of all such automorphisms by G. 

According to Theorem 5.3.5, since G is a finite group of 
automorphisms of F(t x , — , t n ), the extension F(G) < F(t 1? ...,t n ) is 
finite and Galois, with Galois group G and so 

[F(t v ...,t n ):F(G)]= I G | = | S n | = n! 

Since every elementary symmetric function is fixed by the elements of 
G (hence the name symmetric function), we have 

F( Sl ,..., Sn )<F(G)<F(t 1 ,...,t n ) 

and since 

[F(ti,...,t n ):F(s 11 ...,s n )] <n! 
we have equality above and F( G) = F(s 1? . . .,s n ). 

Theorem 6.2.1 Let tj,...,t n be algebraically independent over F and let 
s 1? . . . , s n be the elementary symmetric functions in t 1? . . . , t n . 

1) F(s 1 ,. ..,s n ) < F(t 1 ,.. ,,t n ) is a Galois extension of degree n!, 
whose Galois group is isomorphic to the symmetric group S n . 

2) The generic polynomial g(x) is irreducible over F[s 1} . . .,sj. 

Proof. To prove part 2), observe that if g(x) = a(x)b(x) where 
deg a(x) = d > 0 and deg b(x) = e > 0, then the Galois group of g(x) 
would have size at most d!e! < (d -fe)! = n!. Hence g(x) is irreducible. I 

Definition A polynomial p(t x ,. . . , t n ) G F[tj,...,t n ] is symmetric if 
P(Ml)’ ^(n)) = P( 4 l» • • • > ^n) 

for all permutations a G S n . Equivalently, p is symmetric if 
fffoftl. — .U] =P(ti,...,t n ) 



for all <T G G. D 

Thus, a polynomial p(t 1 ,...,t n )GF(t 1 ,...,t n ) is symmetric if and 
only if it lies in the fixed field F(s 1 ,...,s n ), that is, if and only if it is a 
rational function in Sj,...,s n . However, we can improve considerably 
upon this statement. 
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Theorem 6.2.2 Let t-j_ , . . . , t n be algebraically independent over F and let 
s 1 ,...,s n be the elementary symmetric functions in tj,...,t n . A 
polynomial p(t 1 ,...,t n ) is symmetric if and only if there 

exists a polynomial q(x 1 ,...,x n ) over F for which p(t 1? ...,t n ) = 
q(s 1 ,...,s n ). Moreover, if p(t l9 . - - i t n ) has integer coefficients, then 
q(x 1 ,...,x n ) can be chosen with integer coefficients. 

Proof. If p(t 1 ,...,t n ) has the form q(s 1 ,...,s n ), then it is clearly 
symmetric. For the converse, the proof consists of a procedure that can 
be used to construct the polynomial q(x 1 ,. . .,x n ). Unfortunately, while 
the procedure is quite straightforward, it is recursive in nature and not 
at all practical. 

We use induction on n. The theorem is true for n = 1, since s 1 = t v 
Assume the theorem is true for any number of variables less than n and 
let p(t 1 ,...,t n ) be symmetric. By collecting powers of t n , we can write 

p( 4 u- ••.*!») = Po + P^n + P2 t n + ’ ‘ ’ + Pn^ 

where each pj is a polynomial in tj, . . . , t n—1 . Since p is symmetric in 
ti,...,t n _i and tj,...,^ are independent, each of the coefficients pj is 
symmetric in t l9 . . . , t n— j. By the inductive hypothesis, we may express 
each pj as a polynomial in the elementary symmetric functions on 
t v . . . , t n _ r If these functions are denoted by u v . . . , u n __ x , then we have 

(6.2.1) = q 0 + qi t n + q 2 t2+... + q n t" 

where each qj is a polynomial in u 1 ,...,u n __ 1 , with integer coefficients if 
p has integer coefficients. 

Note that the symmetric functions Sj can be expressed in terms of the 
symmetric functions Uj as follows 

S 1 = u l+ t n 
s 2 = u 2 + Ul t n 

( 6 . 2 . 2 ) ; 

s n-l ” u n-l + u n— 2^n 
s n = Un-ltn 

These expressions can be solved for the Uj’s in terms of the Sj’s, giving 
U 1 = ®1 ~ 

u 2 = s 2 -u 1 t n = s 2 -s 1 t n + t2 




6 Galois Theory II 



131 



u 3 = s 3 ~ u 2 4 n = s 3 - “ *n 

u n-l = s n— 1 - u n-2 t n = s n-l “ ^-2^ + ' ' ‘ + (- 1 )"" 1 *!}- 1 

and from the last equation in (6.2.2), 

(6.2.3) 0 = s n — u n _ x t n = s n - + • • • + (-l) n t" 

Substituting these expressions for the Uj’s into (6.2.1) gives 

P(ti> • • • » t n ) = r 0 + r x t n + r 2 t£ + • • • + r n t" 

where each rj is a polynomial in s 1 ,...,s n _ 1 and t n , with integer 
coefficients if p has integer coefficients. Again, we may gather together 
powers of t n , to get 

P( t l>-..,t n )=g 0 + glt n + g 2 tn + --- + gm t n 

where each gj is a polynomial in s 1 ,...,s n _ 1 , with integer coefficients if 
p has integer coefficients. If m > n, we may reduce the degree in t n by 
using (6.2.3), which also introduces the term s n . Hence, 

(6.2.4) p(t 1 ,...,t n ) = h 0 + h 1 t n + h 2 t2 + ... + h n _ 1 tj;- 1 

where each hj is a polynomial in s 1 ,...,s n , with integer coefficients if p 
has integer coefficients. 

Since the left side of (6.2.4) is symmetric in the tj’s, we may replace 
t n by t|, for each i = 1, . . . , n — 1, to get 

P(*>V • • • » tn) = ^0 + Mi + M? H ^ 

valid for all i = l,...,n. Hence, the polynomial 

P(x) = h 0 + h x x + h 2 x 2 + • • • + h n _ x x n-1 - p(t x , . . . , t n ) 

has degree (in x) at most n — 1 but has n distinct roots t 1? . . . , t n , 
whence it must be the zero polynomial. Thus, hj = 0 for i > 1 and 
p^j,...,^) = h 0 = h 0 (s 1 ,...,s n ), as desired. I 

Example 6.2.1 Let p(x) = x n — SjX 11-1 H h(-l) n s n be a polynomial 

with roots r 1 ,...,r n in a splitting field. For k > 1, the polynomials 

u k = r* + rk + ... + rk 
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are symmetric in the roots of p(x), and so Theorem 6.2.2 implies that 
they can be expressed as polynomials in the elementary symmetric 
functions s 1 ,...,s n of the roots. One way to derive an expression 
relating the u k ’s to the s k ’s is by following the proof of Theorem 6.2.2. 
In the exercises, we ask the reader to take another approach to obtain 
the so-called Newton identities 

u k - u k-l s l + U k-2 S 2 + • • • + (- 1 ) k_lu i s k-i + (-^H = 0 

for k > 1. These identities can be used to compute recursively the u k ’s 
in terms of the s^s. 0 

Since any symmetric polynomial in the roots of a given polynomial 
p(x) is a polynomial in the coefficients of p(x) as well, it therefore lies 
in the base field. 

Corollary 6.2.3 Let p(x) 6 F[x] have roots r 1? . . . , r n in a splitting field. If 
f(ti,...,t n ) is a symmetric polynomial, then f(r 1? . . . ,r n ) is a polynomial 
in the coefficients of p(x), and thus lies in F. 

Proof. We know that f^,...,^) = g(s 1 ,...,s n ) where Sj is the i-th 
elementary symmetric polynomial in the roots r^...,^. But Sj or -Sj is 
the coefficient of x n ~ l in p(x), whence f is a polynomial in these 
coefficients. I 

Theorem 6.2.4 The elementary symmetric polynomials s 1 ,...,s n are 
algebraically independent over F. 

Proof. Since F(s 1 ,...,s n ) < F(t 1 ,...,t n ) is algebraic, Theorem 3.3.1 
implies that S = {s 1 ,...,s n } contains a transcendence basis for 
F(t 1 ,...,t n ) over F. But {t 1? . . . , t n } is a transcendence basis and so 
[F(t 1 ,...,t n ):F] t = n. Hence, S is a transcendence basis. I 

6.3 The Discriminant of a Polynomial 

We have seen that the Galois group G F (p(x)) of a polynomial of 
degree n is isomorphic to a subgroup of the symmetric group S n and 
that the Galois group of a generic polynomial is isomorphic to S n itself. 
A special symmetric function of the roots of p(x), known as the 
discriminant, provides a useful tool for determining whether or not the 
Galois group is isomorphic to a subgroup of the alternating group. 

Let p(x) be a polynomial over F, with roots r 1 ,...,r n in a splitting 
field E. Let 
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s = n ( r i — r i> 

i<j 

The discriminant of p(x) is A = 6 2 . Note that A / 0 if and only if p(x) 
has no multiple roots. 

Let us assume that A ^ 0. Hence p(x) is the product of distinct 
separable polynomials, implying that F < E is a Galois extension. Each 
< 7 E Gp(p(x)) acts as a permutation of the roots rj and so 

*6 = (-1 )"6 

where (-1)* is 1 if a is an even permutation and -1 if a is an odd 
permutation. Hence, a A = A, implying that A E F. If char(F) = 2, then 
<t6 = 6 for all a E Gp(p(x)) and so 6 E F. 

If char(F) ^ 2, we have two possibilities. If 6 E F then all & E 
Gp(p(x)) fix 6 and are therefore even. Hence Gp(p(x)) is isomorphic to 
a subgroup of the alternating group A n . If 6 £ F then Gp(p(x)) must 
contain an odd permutation. It is not hard to show that if a subgroup 
of S n contains an odd permutation then the subgroup has even order 
and exactly half of its elements are even. Hence, if 6 ^ F then Gp(p(x)) 
has even order and 

I Gf(p(x)) n A n I = i I (? F (p(x)) I 

If we let H = Gp(p(x)) fl A n then F( H) < E is Galois, with Galois group 
H and so 

[E:f(H)] = | H | = 1 1 <J F (p(x)) | = I[E:F] 

which implies that [E(H):F] = 2. But [F(<5):F] = 2 and F(6) C F(H), 
whence F(H) = F(5). In words, the fixed field of the even permutations 
in Gp(p(x)) is F(6). Let us summarize. 

Theorem 6.3.1 Let p(x) E F[x] have splitting field E. 

1) A = 0 if and only if p(x) has multiple roots in E. 

2.) Assume that A ^ 0 and char(F) ^ 2. 

a) If A has a square root in F, then the Galois group Gp(p(x)) is 
isomorphic to a subgroup of the alternating group A n . 

b) If A does not have a square root in F, then the Galois group 
Gp(p(x)) contains half odd and half even permutations of the 
roots of p(x). In addition, the fixed field of Gp(p(x))flA n is 

F(n/A). 

3) Assume that A ^ 0 and char(F) = 2. Then A has a square root in 
F, but Gp(p(x) need not be isomorphic to a subgroup of A n . 
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Proof. For part 3), observe that the generic polynomial 



g(x) = (x-t 1 )-.-(x-t n ) 

has Galois group S n over F(s 1 ,. . ,,s n ). I 

The usefulness of Theorem 6.3.1 comes from the fact that A can 
actually be computed in some cases. To see why this is so, observe that 



S is the Vandermonde determinant 






1 1 1 


1 


6 = 


r l r 2 


r n 




r?" 1 r 2 _1 ••• 


4T 1 


Taking the transpose and multiplying gives 






u 0 Uj ••• 


u n-l 


A = 


u i u 2 ••• 


u n 




u n— 1 u n •" 


u 2n-2 



where Uj = + x l 2 H h r^. Newton’s identities can then be used to 

determine the Uj’s in terms of the coefficients of the polynomial in 
question (see Example 6.2.1 and the exercises). We will see some 
examples of this in the next section. 



6.4 The Galois Groups of Some Small Degree 
Polynomials 

Quadratic Polynomials 

Quadratic extensions (extensions of degree 2) hold no surprises except 
perhaps for certain base fields of characteristic 2. Let 

p(x) = x 2 + bx + c = (x — r)(x — s) 

be a quadratic over F, with splitting field E. To compute the 
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discriminant, observe that = r + s = b and 



Hence 



u 2 = r 2 -f s 2 = (r + s) 2 - 2rs = b 2 - 2c 




2 b 
b b 2 -2c 



= 2(b 2 — 2c) — b 2 = b 2 — 4c 



a familiar quantity. 

If A = 0 then p(x) has a double root r and 



p(x) = (x - r) 2 = x 2 - 2rx + r 2 

The root r will lie in F for most well-behaved base fields F. In 
particular, if char(F) ^ 2, then -2r € F implies r E F. If char(F) = 2 
and F is perfect (a finite field, for example) then r € F. However, the 
following example shows that p(x) may have a multiple root not lying 
in F. Let F = Z 2 (t 2 ) where t is transcendental over Z 2 and let 

p(x) = x 2 - 1 2 = (x - 1) 2 

Since t £ Z 2 (t 2 ), this polynomial is irreducible over Z 2 (t 2 ), but has a 
multiple root t $ F. 

If A ^ 0 then p(x) has distinct roots and there are two possibilities: 
(i) the roots lie in F, p(x) is reducible and G F (p(x)) is trivial, or (ii) the 
roots do not lie in F, p(x) is irreducible and G F (p(x)) ~ Z 2 is generated 
by the map cr: r— >s. When char(F) ^ 2, we can tell whether or not the 
roots lie in F by looking at the discriminant, since the quadratic 
formula gives 

_ -b ± y/b 2 — 4c __ -b ± y/K 
’ 2 ~ 2 

Hence the roots lie in F if and only if A has a square root in F. 

Theorem 6.4.1 Let p(x) € F[x] have degree 2. 

1) If A = 0 then p(x) = (x-r) 2 has a double root r, which may or 
may not lie in F. In any case, G F (p(x)) is trivial. 

2) If A ^ 0 then p(x) has distinct roots and there are two 
possibilities: (i) the roots lie in F, p(x) is reducible and G F (p(x)) is 
trivial, or (ii) the roots do not lie in F, p(x) is irreducible and 
G F (p(x)) ~ Z 2 is generated by the map cni— +s. 

3) If char(F) ^ 2 then all quadratic extensions F < E have the form 
E = F( v /a), for some a £ F. 
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Proof. Part 3) follows from the fact that if A is the discriminant of 
mm(a,F) then part 2) implies that y/X ^ F, whence E = F(\/A). I 

Let us turn now to a more interesting case. 

Cubic Polynomials 

Let 

p(x) = x 3 + bx 2 + cx -I- d = (x — r)(x - s)(x — t) E F[x] 

have splitting field E. Then p(x) is irreducible if and only if none of its 
roots lie in F. Let us assume that p(x) is irreducible. A straightforward 
but lengthy computation gives 

A = -4b 3 d + b 2 c 2 + 18bcd - 4c 3 - 27d 2 

Assume first that A = 0. Then p(x) has multiple roots and Corollary 
1.6.4 implies that p(x) = q(x pk ), where p = expchar(F) and p k > 1. 
Since deg p(x) = 3, we must have p = 3, k = 1 and so 

p(x) = (x-r) 3 = x 3 -r 3 

has a single root of multiplicity 3. The extension F < F(r) = E is purely 
inseparable of degree 3 and the Galois group is trivial. 

If A 0 then p(x) has no multiple roots and is therefore separable. 
Hence, F < E is Galois and | Gp(p(x)) | = [E:F]. Since r ^ F, we have 
[E:F] > 1, which leaves the possibilities [E:F] = 3 or 6. If p(x) splits in 
F(r), then [E:F] = 3 and the Galois group is isomorphic to A 3 ~ Z 3 . If 
p(x) does not split in F(r), then [E:F] = 6, in which case the Galois 
group is isomorphic to S 3 . When char(F) ^ 2, these two cases can be 
distinguished by examining the discriminant. If y/X E F, then 
Gp(p(x)) ~ A 3 and if y/X ^ F then Gp(p(x)) ~ S 3 . 

Theorem 6.4.2 Let p(x) G F[x] be irreducible of degree 3. 

1) If A = 0 then p(x) has a single root of multiplicity 3 and 
char(F) = 3. The Galois group is trivial. 

2) If A ^ 0 then Gp(p(x)) ~ A 3 or S 3 . 

2) Let char(F) / 2. If 0 ^ y/X G F then Gp(p(x)) ~ A 3 and adjoining 
a single root of p(x) to F gives the splitting field for p(x). If 
y/X ^ F then Gp(p(x)) ~ S 3 . □ 

Example 6.4.1 Let p(x) = x 3 — 2x 2 — x + 1 over Q. Any rational root of 
p(x) must be ± 1 (Theorem 1.2.2) and so p(x) is irreducible. The 
discriminant is A = 49 which has a square root in Q and so 
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Gq(p(x)) ~ A 3 is cyclic of order 3. On the other hand, the irreducible 
polynomial q(x) = x 3 - x -f 1 has discriminant A = -23, which has no 
square root in Q. Hence, its Galois group is isomorphic to S 3 . D 

*Quartic Polynomials 

Since the Galois group of an irreducible quartic polynomial is 
isomorphic to a transitive subgroup of S 4 , we should begin by 
determining all such subgroups of S 4 . Theorem 0.2.21 implies that if G 
is a transitive subgroup of S 4 then | G | = 4, 8, 12 or 24. Here is a list. 

1) The cyclic group Z 4 occurs as a subgroup of S 4 , for instance 
((1234)) ~ Z 4 . 

2) The four group Z 2 x 2 2 occurs as a subgroup of S 4 . In particular 

V = {i, (12)(34), (13)(24), (14)(23)} 

is isomorphic to Z 2 x Z 2 and is known as the viergruppe. We leave 
it to the reader to show that V is normal in S 4 . This and the 
previous case exhaust all nonisomorphic groups of order 4. 

3) The dihedral group D 4 of symmetries of the square, thought of as 
permutations of the corners of the square, is a subgroup of S 4 of 
order 8. Since D 4 is a Sylow 2-subgroup of S 4 , all subgroups of S 4 
of order 8 are isomorphic to D 4 . 

4) The alternating group A 4 is the only subgroup of S 4 of index 2, 
that is, of order 12. 

5) Of course, S 4 is the only subgroup of S 4 of order 24. 

Let p(x) = x 4 + ax 3 + bx 2 -f cx + d be an irreducible quartic over F 
and let us assume that char(F) ^ 2, 3. This will insure that 4^0 and 
that all irreducible cubic polynomials that we may encounter are 
separable. Replacing x by x — a/4 will eliminate the cubic term, 
resulting in a polynomial of the form 

q(x) = x 4 + px 2 + qx + r 

The polynomials p(x) and q(x) have the same splitting field and hence 
the same Galois group, so let us work with q(x). Let E be the splitting 
field of q(x), let be its roots in E and let G = G F (E) be its 

Galois group. For convenience, we identify G with its isomorphic image 
in S 4 . 

The Quartic x 4 + bx 2 + c 

In order to get our feet wet, let us first consider the special case 
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q(x) = X 4 -f bx 2 -f c 

If we denote the roots of q(x) in E by ± a, ± /? then E = F(a,/?) and 
b = -(a 2 + /? 2 ), c = a 2 /? 2 
We define the associated quadratic to q(x) to be 

q(x) = x 2 + bx + c 

The roots ot 2 and /? 2 of q(x) are given by 




The irreducibility of q(x) can be determined as follows. Certainly if 
q(x) is reducible over F, then so is q(x). On the other hand, if q(x) is 
irreducible then its roots a 2 and /? 2 do not lie in F, whence q(x) cannot 
have a linear factor over F and, if reducible, must have the form 

q(x) = x 4 + bx 2 + c = (x 2 + ux + v)(x 2 - ux + w) 

where, as seen by equating coefficients, u(v — w) = 0. However, if u = 0 
then 

q(x) = (x 2 + v)(x 2 + w) 

which gives 

q(x) = (x + v)(x + w) 

contradicting the irreducibility of q(x). Thus, u ^ 0 and v = w. We can 
summarize as follows: 

1) If vb 2 — 4c G F then q(x), and therefore q(x), is reducible. 

2) If y/b 2 -4c ^ F then q(x) is reducible if and only if it has the 
form 

q(x) = x 4 + bx 2 + c = (x 2 + ux + v)(x 2 — ux + v) 
where v 2 = c and 2v — u 2 = b. 

For example, let q(x) = x 4 + 6x 2 + 4 over Q. Then b 2 — 4c = 20 and 
V^O ^ Q. From 2) above we have 



v 2 = 4, v = ± 2 




